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Abstract 

AutoMap  is  software  for  computer-assisted  Network  Text  Analysis  (NTA).  NTA  encodes 
the  links  among  words  in  a  text  and  constructs  a  network  of  the  linked  words.  AutoMap 
subsumes  classical  Content  Analysis  by  analyzing  the  existence,  frequencies,  and 
covariance  of  terms  and  themes. 
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AutoMap:  An  Overview 

AutoMap  is  a  software  tool  to  analyze  text  using  the  method  of  Network  Text  Analysis.  It 
performs  a  specific  type  of  Network  Text  Analysis  called  Semantic  Network  Analysis. 
Semantic  analysis  extracts  and  analyzes  links  among  words  to  model  an  authors  “mental 
map”  as  a  network  of  links.  Additionally,  Automap  supports  Content  Analysis. 

Coding  in  AutoMap  is  computer-assisted;  the  software  applies  a  set  of  coding  rules 
specified  by  the  user  in  order  to  code  the  texts  as  networks  of  concepts.  Coding  texts  as 
maps  focuses  the  user  on  investigating  meaning  among  texts  by  finding  relationships 
among  words  and  themes. 

The  coding  rules  in  AutoMap  involve  text  pre-processing  and  statement  formation,  which 
together  form  the  coding  scheme.  Text  pre-processing  condenses  data  into  concepts, 
which  capture  the  features  of  the  texts  relevant  to  the  user.  Statement  formation  rules 
determine  how  to  link  concepts  into  statements. 

Listed  below  are  the  steps  a  user  would  follow  (in  typical  order)  to  use  AutoMap: 

1.  Pre-Process  texts. 

2.  Perform  Semantic  Network  Analysis  on  texts. 

3.  Run  MetaMatrix  Text  Analysis  and  Sub  Matrix  Text  Analysis  (Both  techniques 
are  sub-types  of  Map  Analysis). 

4.  Compare  Maps  generated  with  AutoMap. 

5.  Compute  network  analytic  measures  per  texts  and  words. 

Network  Text  Analysis  (NTA) 

NTA  theory  is  based  on  the  assumption  that  language  and  knowledge  can  be  modeled  as 
networks  of  words  and  relations.  Network  Text  Analysis  encodes  links  among  words  to 
construct  a  network  of  linkages.  Specifically,  Network  Text  Analysis  analyzes  the 
existence,  frequencies,  and  covariance  of  terms  and  themes,  thus  subsuming  classical 
Content  Analysis. 

Semantic  Network  Analysis 

In  map  analysis,  a  concept  is  a  single  idea,  or  ideational  kernel,  represented  by  one  or 
more  words.  Concepts  are  equivalent  to  nodes  in  Social  Network  Analysis  (SNA).  The 
link  between  two  concepts  is  referred  to  as  a  statement,  which  corresponds  with  an  edge 
in  SNA.  The  relation  between  two  concepts  can  differ  in  strength,  directionality,  and 
type.  The  union  of  all  statements  per  texts  forms  a  semantic  map.  Maps  are  equivalent  to 
networks. 
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Social  Network  Analysis  (SNA) 

Social  Network  Analysis  is  a  scientific  area  focused  on  the  study  of  relations,  often 
defined  as  social  networks.  In  its  basic  form,  a  social  network  is  a  network  where  the 
nodes  are  people  and  the  relations  (also  called  links  or  ties)  are  a  form  of  connection  such 
as  friendship.  Social  Network  Analysis  takes  graph  theoretic  ideas  and  applies  them  to 
the  social  world.  The  term  “social  network”  was  first  coined  in  1954  by  J.  A.  Barnes  (see: 
Class  and  Committees  in  a  Norwegian  Island  Parish).  Social  network  analysis  is  also 
called  network  analysis,  structural  analysis,  and  the  study  of  human  relations.  SNA  is 
often  referred  to  as  the  science  of  “connecting  the  dots.” 

Today,  the  term  Social  Network  Analysis  (or  SNA)  is  used  to  refer  to  the  analysis  of  any 
network  such  that  all  the  nodes  are  of  one  type  (e.g.,  all  people,  or  all  roles,  or  all 
organizations),  or  at  most  two  types  (e.g.,  people  and  the  groups  they  belong  to).  The 
metrics  and  tools  in  this  area,  since  they  are  based  on  the  mathematics  of  graph  theory, 
are  applicable  regardless  of  the  type  of  nodes  in  the  network  or  the  reason  for  the 
connections. 

For  most  researchers,  the  nodes  are  actors.  As  such,  a  network  can  be  a  cell  of  terrorists, 
employees  of  global  company  or  simply  a  group  of  friends.  However,  nodes  are  not 
limited  to  actors.  A  series  of  computers  that  interact  with  each  other  or  a  group  of 
interconnected  libraries  can  comprise  a  network  also. 

Where  to  find  out  more  on  SNA? 

•  Scott,  John,  2000,  Social  Networks,  Sage  (2nd  editionl 

•  Wasserman,  S.  &  K.  Faust,  1994,  Social  Network  Analysis:  Methods  and 
Applications. 

Dynamic  Network  Analysis 

Dynamic  Network  Analysis  (DNA)  is  an  emergent  scientific  field  that  brings  together 
traditional  social  network  analysis  (SNA),  link  analysis  (LA)  and  multi-agent  systems 
(MAS).  There  are  two  aspects  of  this  field.  The  first  is  the  statistical  analysis  of  DNA 
data.  The  second  is  the  utilization  of  simulation  to  address  issues  of  network  dynamics. 
DNA  networks  vary  from  traditional  social  networks  in  that  are  larger  dynamic  multi- 
mode,  multi-plex  networks,  and  may  contain  varying  levels  of  uncertainty. 

DNA  statistical  tools  are  generally  optimized  for  large-scale  networks  and  admit  the 
analysis  of  multiple  networks  simultaneously  in  which,  there  are  multiple  types  of  nodes 
(multi-node)  and  multiple  types  of  links  (multi-plex). In  contrast,  SNA  statistical  tools 
focus  on  single  or  at  most  two  mode  data  and  facilitate  the  analysis  of  only  one  type  of 
link  at  a  time. 

DNA  statistical  tools  tend  to  provide  more  measures  to  the  user,  because  they  have 
measures  that  use  data  drawn  from  multiple  networks  simultaneously.  From  a  computer 
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simulation  perspective,  nodes  in  DNA  are  like  atoms  in  quantum  theory,  nodes  can  be, 
though  need  not  be,  treated  as  probabilistic.  Whereas  nodes  in  a  traditional  SNA  model 
are  static,  nodes  in  a  DNA  model  have  the  ability  to  learn.  Properties  change  over  time; 
nodes  can  adapt:  A  company's  employees  can  learn  new  skills  and  increase  their  value  to 
the  network;  Or,  kill  one  terrorist  and  three  more  are  forced  to  improvise.  Change 
propagates  from  one  node  to  the  next  and  so  on.  DNA  adds  the  critical  element  of  a 
network's  evolution  and  considers  the  circumstances  under  which  change  is  likely  to 
occur. 


Illustrative  problems  that  people  in  the  DNA  area  work 
on  - 

1 .  Developing  metrics  and  statistics  to  assess  and  identify  change  within  and  across 
networks. 

2.  Developing  and  validating  simulations  to  study  network  change,  evolution, 
adaptation,  decay... 

3.  Developing  and  validating  formal  models  of  network  generation  and  evolution. 

4.  Developing  and  testing  theory  of  network  change,  evolution,  adaptation,  decay... 

5.  Developing  techniques  to  visualize  network  change  overall  or  at  the  node  or 
group  level. 

6.  Developing  statistical  techniques  to  see  whether  differences  observed  over  time  in 
networks  are  due  to  simply  different  samples  from  a  distribution  of  links  and 
nodes  or  changes  over  time  in  the  underlying  distribution  of  links  and  nodes. 

7.  Developing  control  processes  for  networks  over  time. 

8.  Developing  algorithms  to  change  distributions  of  links  in  networks  over  time. 

9.  Developing  algorithms  to  track  groups  in  networks  over  time. 

10.  Developing  tools  to  extract  or  locate  networks  from  various  data  sources  such  as 
texts. 

11.  Developing  statistically  valid  measurements  on  networks  over  time. 

12.  Examining  the  robustness  of  network  metrics  under  various  types  of  missing  data. 

13.  Empirical  studies  of  multi-mode  multi-link  multi-time  period  networks. 

14.  Examining  networks  as  probabilistic  time-variant  phenomena. 

15.  Eorecasting  change  in  existing  networks  Identifying  trails  through  time  given  a 
sequence  of  networks.  Identifying  changes  in  node  criticality  given  a  sequence  of 
networks  anything  else  related  to  multi-mode  multi-link  multi-time  period 
networks. 


Getting  Started 

•  Dynamic  Network  Analysis 

•  Social  Network  Analysis 
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Dynamic  Network  Analysis 

Dynamic  Network  Analysis  (DNA)  is  an  emergent  scientific  field  that  brings  together 
traditional  social  network  analysis  (SNA),  link  analysis  (LA)  and  multi-agent  systems 
(MAS).  There  are  two  aspects  of  this  field.  The  first  is  the  statistical  analysis  of  DNA 
data.  The  second  is  the  utilization  of  simulation  to  address  issues  of  network  dynamics. 
DNA  networks  vary  from  traditional  social  networks  in  that  are  larger  dynamic  multi- 
mode,  multi-plex  networks,  and  may  contain  varying  levels  of  uncertainty. 

DNA  statistical  tools  are  generally  optimized  for  large-scale  networks  and  admit  the 
analysis  of  multiple  networks  simultaneously  in  which,  there  are  multiple  types  of  entities 
(multi-entities)  and  multiple  types  of  links  (multi-plex).  In  contrast,  SNA  statistical  tools 
focus  on  single  or  at  most  two  mode  data  and  facilitate  the  analysis  of  only  one  type  of 
link  at  a  time. 

DNA  statistical  tools  tend  to  provide  more  measures  to  the  user,  because  they  have 
measures  that  use  data  drawn  from  multiple  networks  simultaneously.  From  a  computer 
simulation  perspective,  entities  in  DNA  are  like  atoms  in  quantum  theory,  entities  can  be, 
though  need  not  be,  and  treated  as  probabilistic.  Whereas  entities  in  a  traditional  SNA 
model  are  static,  entities  in  a  DNA  model  have  the  ability  to  learn.  Properties  change  over 
time;  entities  can  adapt:  A  company's  employees  can  learn  new  skills  and  increase  their 
value  to  the  network;  Or,  kill  one  terrorist  and  three  more  are  forced  to  improvise. 

Change  propagates  from  one  entity  to  the  next  and  so  on.  DNA  adds  the  critical  element 
of  a  network's  evolution  and  considers  the  circumstances  under  which  change  is  likely  to 
occur. 

Where  to  learn  to  more: 

•  Kathleen  M.  Carley,  2003,  “  Dynamic  Network  Analysis”  in  Dynamic  Social 
Network  Modeling  and  Analysis:  Workshop  Summary  and  Papers,  Ronald 
Breiger,  Kathleen  Carley,  and  Philippa  Pattison,  (Eds.)  Committee  on  Human 
Factors,  National  Research  Council,  National  Research  Council.  Pp.  133-145, 
Washington,  DC. 

•  Kathleen  M.  Carley,  2002,  “  Smart  Agents  and  Organizations  of  the  Future”  The 
Handbook  of  New  Media.  Edited  by  Feah  Fievrouw  and  Sonia  Fivingstone,  Ch. 
12,  pp.  206-220,  Thousand  Oaks,  CA,  Sage. 

•  Kathleen  M.  Carley,  Jana  Diesner,  Jeffrey  Reminga,  Maksim  Tsvetovat,  2005- 
forthcoming,  Toward  an  Interoperable  Dynamic  Network  Analysis  Toolkit,  DSS 
Special  Issue  on  Cyberinfrastructure  for  Homeland  Security:  Advances  in 
Information  Sharing,  Data  Mining,  and  Collaboration  Systems. 

Social  Network  Analysis  (SNA) 

Social  Network  Analysis  is  a  scientific  area  focused  on  the  study  of  relations,  often 
defined  as  social  networks.  In  its  basic  form,  a  social  network  is  a  network  where  the 
entities  are  people  and  the  relations  (also  called  links  or  ties)  are  a  form  of  connection 
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such  as  friendship.  Social  Network  Analysis  takes  graph  theoretic  ideas  and  applies  them 
to  the  social  world.  The  term  social  network  was  first  coined  in  1954  by  J.  A.  Barnes 
(see:  Class  and  Committees  in  a  Norwegian  Island  Parish).  Social  network  analysis  is 
also  called  network  analysis,  structural  analysis,  and  the  study  of  human  relations.  SNA  is 
often  referred  to  as  the  science  of  connecting  the  dots. 

Today,  the  term  Social  Network  Analysis  (or  SNA)  is  used  to  refer  to  the  analysis  of  any 
network  such  that  all  the  entities  are  of  one  type  (e.g.,  all  people,  or  all  roles,  or  all 
organizations),  or  at  most  two  types  (e.g.,  people  and  the  groups  they  belong  to).  The 
metrics  and  tools  in  this  area,  since  they  are  based  on  the  mathematics  of  graph  theory, 
are  applicable  regardless  of  the  type  of  nodes  in  the  network  or  the  reason  for  the 
connections. 

For  most  researchers,  the  entities  are  actors.  As  such,  a  network  can  be  a  cell  of  terrorists, 
employees  of  global  company  or  simply  a  group  of  friends.  However,  entities  are  not 
limited  to  actors.  A  series  of  computers  that  interact  with  each  other  or  a  group  of 
interconnected  libraries  can  comprise  a  network  also. 

Where  to  find  out  more  on  SNA 

•  Scott,  John,  2000,  Social  Networks,  Sage  (2nd  edition’) 

•  Wasserman,  5.  &  K.  Faust,  1994,  Social  Network  Analysis:  Methods  and 
Applications 
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Automap  Graphical  User  Interface 


AutoMap's  graphical  user  interface  (GUI)  is  divided  into  four  primary  quadrants  (or 
panels):  they  will  be  referred  to  as  PI  (top  left),  P2  (bottom  left),  P3  (top  right)  and  P4 
(bottom  right). 

The  drop-down  menu  bar  provides  access  to  various  analysis  tools  and  utilities.  The 
browse  menu  bar  allows  you  to  quickly  navigate  between  loaded  texts. 

Index  cards,  or  “tabs”,  provide  a  tabular  interface  allowing  you  to  navigate  each  panel 
respectively.  The  GUI  reads  any  changes  dynamically  from  the  XML  file.  To  do  that,  the 
user  needs  to  refresh  the  tool. 

The  screen  shot  below  highlights  primary  features  of  the  Automap  GUI  and  where  to  find 
them: 


Menus 


Irowse  Menu 


Index  Cards 


Panel 


It  is  important  to  note  that  P2  can  be  edited.  The  other  panels  (PI,  P3,  P4)  cannot  be 
edited.  Information  displayed  on  PI  to  P4  always  relates  to  each  other.  The  Text  Browse 
Menu  relates  to  all  panels  at  the  same  time. 

Window  sizes  do  not  have  an  upper  threshold.  AutoMap  will  automatically  set  window 
size  to  largest  text  size  upon  user’s  request.  This  is  a  new  button  on  the  Analysis  Settings 
panel.  This  enables  text  set  specific  maximum  window  sizes,  which  also  enhances 
efficiency. 
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The  Action  Tracer  Panel  in  P4  will  log  preprocessing  utilities  applied  to  your  text.  This  is 
a  handy  way  to  keep  track  of  changes  and  actions  relating  to  your  text. 

Tool  Tips 

In  the  various  pre-processing  panels,  such  as  utilities,  “tool  tips”,  provide  more 
information  on  certain  routines.  Tool  tips  become  visible  when  you  slide  he  mouse  over 
that  particular  tool. 


Load  Input 

1.  To  open  a  single  text 

To  open  a  single  text  file  into  Automap,  proceed  as  follows  from  the  Automap  menu  bar: 

File  Open  >  Open  single  file 
A  file  chooser  will  pop  up  (screen  shot  below). 


Double  click  on  the  file  that  you  wish  to  analyze  >  Select  the  Open  button. 
The  text  will  be  displayed  in  PI  on  tab  No.  1.  Original  Text. 

The  loading  of  .TXT  files  (caps)  is  now  enabled. 
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2.  To  open  a  set  of  texts 

Should  you  wish  to  analyze  multiple  texts  at  the  same  time,  they  must  be  stored  in  one 
folder.  To  do  so,  proeeed  as  follows  from  the  Automap  menu  bar: 

File  menu  >  Open  multiple  files 

After  you  select  “Open  multiple  files”  a  folder  chooser  will  pop  up.  Again,  be  sure  that 
correct  folder  is  selected  in  the  folder  chooser.  The  black  ellipses  in  the  screen  shot  below 
highlight  where  you  should  be  looking  in  the  file  chooser: 


Text  Examples 

This  user's  guide  provides  illustrative  examples  for  all  AutoMap  functions.  The  sample 
texts  below  will  be  used  throughout  this  guide. 

Tip!  It  is  suggested  you  follow  along  using  the  examples  below.  To  do  so,  simply  copy 
and  paste  the  texts  below  into  WordPad  or  notepad  and  save  as  a  .txt  file  in  the  same 
folder. 
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Our  Text  Example 

Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs.  Brown  was 
unhappy  with  the  dog.  She  yelled  at  it  saying  "You  impossible  dog!"  But 
the  dog  kept  eating  the  flowers  and  weeds.  She  asked  Mr.  Cray  to  stop 
the  dog.  He  couldn't. 

Mrs.  Brown  planted  roses  and  weeded  the  garden.  The  silly  dog  %  dug  up 
the  roses  looking  for  a  vole  on  June  12,  1880.  Weeding  was  no  longer 
needed . 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  the  next  day  to  concoct  a  plan. 
John  Darren  and  Mrs.  Brown  put  up  a  scarecrow.  She  thought  it  would 
scare  the  dog.  Mr.  Cray  put  up  a  fence.  Problem  solved.  Then  Mrs.  Brown 
planted  lotus,  carnations,  daffodils,  and  roses. 

Our  Text  Example  II 

Mr.  Cray's  brown  dog  stopped  eating  the  lotus  blossom  at  12  pm  the  next 
day.  Mrs.  Brown  was  now  happy  with  the  dog.  She  said  "You  good  dog!" 

The  dog  no  longer  ate  the  flowers  and  weeds.  Mr.  Cray  was  pleased  too. 
Mrs.  Brown  watered  the  roses  and  fertilized  the  garden  on  June  13, 

1880  . 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  over  dinner  and  discussed  how 
the  plan  had  worked.  John  Darren  and  Mrs.  Brown  would  take  down  the 
scarecrow  the  following  week.  She  thought  it  was  too  scary  for  the  dog. 
Mr.  Cray  painted  his  fence.  Then  Mrs.  Brown  watered  lotus,  carnations, 
daffodils,  and  roses. 
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Choose  the  folder  that  contains  the  texts  you  wish  to  analyze  and  single  click  on  it.  The 
folder  will  be  highlighted.  Do  not  double  click  on  the  folder.  Select  the  Open  button  (see 
above  screen  shot).  The  first  text  will  be  displayed  in  panel  PI,  tab  no.  1.  Original  Text. 
You  can  browse  through  the  texts  by  using  the  Browse  Menu. 


The  Browser 
Menu  allows 
you  to  quickly 
navigate  from 
one  text  sample 
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Parts  of  Speech 


The  Parts  of  Speech  feature  does  exactly  that,  when  text  is  loaded  into  Automap  this 
feature  will  show  you  what  part  of  speech  each  word  is. 

To  use  the  feature: 

•  In  AutoMap  menu  bar,  select  File  >  Open  single  file.  A  folder  chooser  window 
will  pop  up. 

•  Navigate  to  the  folder  containing  the  texts  you  wish  to  analyze,  and  single 
click  on  it. 

•  Load  your  text  file  and  it  will  appear  in  Panel  1  -  Original  Data. 

•  Next  go  to  Panel  2  and  select  3.  Pre-Processing  Settings. 

•  Once  in  the  Pre-Processing  Settings,  select  1.  Utilities. 

•  Scroll  down  the  options  and  single  click  on  Tag  Texts  and  save  Tagging 
Output,  which  is  under  the  heading  Parts  of  Speech  Tagging. 

NOTE:  Data  has  to  be  in  txt  format  with  .txt  extension 


1.  Concept  List  2.  Union  Concept  List  3.  Pre-Processng  Settings  4.  Analysis  Settings 

4.  Generalization  Thesaurus  5.  Meta-Matrix  Thesaurus  6.  Sub-Matrix  Selection 

1  •  Lfcilities  2.  Stemming  3.  Delete  List 


•  The  Tag  Texts  option  that  you  have  just  selected  will  take  your  text  document 
and  define  each  part  of  text. 

•  Go  back  to  Panel  1  and  select  8.  Texts  after  Parts-of-Speech  Tagging.  This 
option  will  show  you  your  document  with  the  different  parts  of  speech. 
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Hr/NNP  Cray's/NNP  brown/ JJ  dog/NN  ate/VBD  the/DT  lotus/NN 
blossom/NN  at/ IN  10/CD  am/VBP  Hrs/NNP  Brown/NNP  was/VBD 
unhappy/ JJ  with/ IN  the/DT  dog/NN  She/PRP  yelled/VBD  at/ IN  it/PRF 
saying/VBG  "You/NN  impossible/ JJ  dog!"/NN  But/CC  the/DT  dog/NN 
kept/VBD  eating/VBG  the/DT  flowers/NNS  and/CC  weeds/NNS  She/PRP 
asked/VBD  Hr/NNP  Cray/NNP  to/TO  stop/VB  the/DT  dog/NN  He/PRP 
couidn't/HD  Hrs/NNP  Brown/NNP  planted/VBD  roses/NNS  and/CC 
weeded/ VBN  the/DT  garden/NN  The/DT  silly/ JJ  dog/NN  %/NN  dug/VBN 
up/ IN  the/DT  roses/NNS  looking/VBG  for/ IN  a/DT  vole/NN  on/ IN 
June/NNP  12, /CD  1880/CD  ¥eeding/NNP  was/VBD  no/RB  longer/RB 
needed/ VBN  Prof/NNP  Darren, /NNP  Hrs/NNP  Brown/NNP  £/NN  Hr/NNP 
Cray/NNP  met/VBD  the/DT  next/JJ  day/NN  to/TO  concoct/NN  a/DT 
plan/NN  John/NNP  Darren/NNP  and/CC  Hrs/NNP  Brown/NNP  put/VBD 
up/RP  a/DT  scarecrow/NN  She/PRP  thought/VBD  it/PRP  would/HD 
scare/VB  the/DT  dog/NN  Hr/NNP  Cray/NNP  put/VB  up/RP  a/DT 
fence/NN  Problem/NNP  solved/VBD  Then/RB  Hrs/NNP  Brown/NNP 
planted/VBD  lotus, /NN  carnations, /NN  daf fodils, /NN  and/CC 


•  To  undo  the  Parts  of  Speech  Tag,  simply  go  back  to  Panel  2,  1.  Utilities  option 
Parts  of  Speech  Tagging  and  select  the  Undo  option. 

2.  Parts  of  Speech  Tagging 

This  routine  associates  every  word  after  the  highest  level  of  pre-processing  applied  so  far 
with  its  Parts  of  Speech. 

In  order  to  tag  your  texts,  go  to  the  Utilities  Panel,  Parts  of  Speech  Tagging  field,  and 
press  the  Tag  Texts  button. 

To  see  the  resulting  tagging,  go  to  the  upper  left  panel,  and  Select  8.  Texts  after  Parts  of 
Speech  tagging. 

To  undo  the  tagging,  go  to  the  Utilities  Panel,  Parts  of  Speech  Tagging  field,  and  press 
the  Undo  Tagging  button. 

This  POS  tagger  was  implemented  based  on  a  Hidden  Markov  Model.  The  learning  data 
stems  from  the  Penn  Treebank  3  corpus.  We  are  grateful  to  Alex  Rudnicky  from  CMU 
for  providing  the  training  data  to  us. 
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Parts  of  Speech 


The  Parts  of  Speech  feature  does  exactly  that,  when  text  is  loaded  into  Automap  this 
feature  will  show  you  what  part  of  speech  each  word  is. 

To  use  the  feature: 

•  In  AutoMap  menu  bar,  select  File  >  Open  single  file.  A  folder  chooser  window 
will  pop  up. 

•  Navigate  to  the  folder  containing  the  texts  you  wish  to  analyze,  and  single 
click  on  it. 

•  Load  your  text  file  and  it  will  appear  in  Panel  1  -  Original  Data. 

•  Next  go  to  Panel  2  and  select  3.  Pre-Processing  Settings. 

•  Once  in  the  Pre-Processing  Settings,  select  1.  Utilities. 

•  Scroll  down  the  options  and  single  click  on  Tag  Texts  and  save  Tagging 
Output,  which  is  under  the  heading  Parts  of  Speech  Tagging. 

NOTE:  Data  has  to  be  in  txt  format  with  .txt  extension 


1.  Concept  List  2.  Union  Concept  List  3.  Pre-Processng  Settings  4.  Analysis  Settings 

4.  Generalization  Thesaurus  5.  Meta-Matrix  Thesaurus  6.  Sub-Matrix  Selection 

1  •  Lfcilities  2.  Stemming  3.  Delete  List 


•  The  Tag  Texts  option  that  you  have  just  selected  will  take  your  text  document 
and  define  each  part  of  text. 

•  Go  back  to  Panel  1  and  select  8.  Texts  after  Parts-of-Speech  Tagging.  This 
option  will  show  you  your  document  with  the  different  parts  of  speech. 
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Hr/NNP  Cray's/NNP  brown/ JJ  dog/NN  ate/VBD  the/DT  lotus/NN 
blossom/NN  at/ IN  10/CD  am/VBP  Hrs/NNP  Brown/NNP  was/VBD 
unhappy/ JJ  with/ IN  the/DT  dog/NN  She/PRP  yelled/VBD  at/ IN  it/PRF 
saying/VBG  "You/NN  impossible/ JJ  dog!"/NN  But/CC  the/DT  dog/NN 
kept/VBD  eating/VBG  the/DT  flowers/NNS  and/CC  weeds/NNS  She/PRP 
asked/VBD  Hr/NNP  Cray/NNP  to/TO  stop/VB  the/DT  dog/NN  He/PRP 
couidn't/HD  Hrs/NNP  Brown/NNP  planted/VBD  roses/NNS  and/CC 
weeded/ VBN  the/DT  garden/NN  The/DT  silly/ JJ  dog/NN  %/NN  dug/VBN 
up/ IN  the/DT  roses/NNS  looking/VBG  for/ IN  a/DT  vole/NN  on/ IN 
June/NNP  12, /CD  1880/CD  ¥eeding/NNP  was/VBD  no/RB  longer/RB 
needed/ VBN  Prof/NNP  Darren, /NNP  Hrs/NNP  Brown/NNP  £/NN  Hr/NNP 
Cray/NNP  met/VBD  the/DT  next/JJ  day/NN  to/TO  concoct/NN  a/DT 
plan/NN  John/NNP  Darren/NNP  and/CC  Hrs/NNP  Brown/NNP  put/VBD 
up/RP  a/DT  scarecrow/NN  She/PRP  thought/VBD  it/PRP  would/HD 
scare/VB  the/DT  dog/NN  Hr/NNP  Cray/NNP  put/VB  up/RP  a/DT 
fence/NN  Problem/NNP  solved/VBD  Then/RB  Hrs/NNP  Brown/NNP 
planted/VBD  lotus, /NN  carnations, /NN  daf fodils, /NN  and/CC 


•  To  undo  the  Parts  of  Speech  Tag,  simply  go  back  to  Panel  2,  1.  Utilities  option 
Parts  of  Speech  Tagging  and  select  the  Undo  option. 

2.  Parts  of  Speech  Tagging 

This  routine  associates  every  word  after  the  highest  level  of  pre-processing  applied  so  far 
with  its  Parts  of  Speech. 

In  order  to  tag  your  texts,  go  to  the  Utilities  Panel,  Parts  of  Speech  Tagging  field,  and 
press  the  Tag  Texts  button. 

To  see  the  resulting  tagging,  go  to  the  upper  left  panel,  and  Select  8.  Texts  after  Parts  of 
Speech  tagging. 

To  undo  the  tagging,  go  to  the  Utilities  Panel,  Parts  of  Speech  Tagging  field,  and  press 
the  Undo  Tagging  button. 

This  POS  tagger  was  implemented  based  on  a  Hidden  Markov  Model.  The  learning  data 
stems  from  the  Penn  Treebank  3  corpus.  We  are  grateful  to  Alex  Rudnicky  from  CMU 
for  providing  the  training  data  to  us. 

Text  Pre-Processing 

1.  Introduction  to  Text  Pre-Processing  in  AutoMap 

Pre-processing  reduces  the  data  to  terms  relevant  to  you. 
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Tip!  All  pre-processing  techniques  in  AutoMap  are  optional. 

There  are  some  points  to  consider  before  you  begin  Pre-Processing: 

1.  Namely,  Map  Analysis  can  be  run  without  any  prior  data  pre-processing. 

2.  Meta  Matrix  Text  Analysis  and  Sub  Matrix  Text  Analysis  require  pre¬ 
processing. 

Pre-processing  is  semi-automated  and  iterative  and  involves  several  key  processes: 

Named-Entity  Recognition 

Named-Entity  Recognition  is  an  Automap  feature  that  allows  you  to  retrieve  proper 
names  (e.g.  names  of  people,  organizations,  places),  numerals,  and  abbreviations  from 
texts  (Magnini,  Negri,  Prevete  &  Tanev,  2002).  The  AutoMap  Named-Entity  Recognition 
functionality  detects: 

•  Single  words  that  are  capitalized. 

o  Example:  Copenhagen. 

•  Adjacent  words  that  are  capitalized. 

o  Example:  The  New  York  City  Police  Department. 

•  A  string  of  adjacent  words  that  are  capitalized,  but  can  be  intervened  by  one 
non-capitalized  word.  The  first  and  the  last  word  in  this  string  are  capitalized. 

o  Example:  Canadian  Department  of  National  Defense. 

Stemming 

Stemming  detects  inflections  and  derivations  of  concepts  in  order  to  convert  each  concept 
into  the  related  morpheme  (Jurafsky  &  Martin,  p.83,  654).  AutoMap  offers  2  stemmers: 

•  The  Porter  Stemmer  (Porter,  1980).  This  stemmer  uses  the  Porter  Stemming 
algorithm.  Additionally,  it  converts  irregular  verbs  into  the  verb's  infinitive. 

o  Example:  The  phrase  "Hospitals  switched  to  using  emergency 

generators,  will  be  stemmed  to  "Hospit  switch  to  be  us  emerg  gener." 

•  KSTEM  or  KROVETZ  stemmer  (Krovetz,  1995).  An  inflection  and  derivation- 
based  stemmer.  The  KSTEM  or  KROVETZ  stemmer  can  be  customized. 

Coiiocation  identification 

A  word's  collocates  are  words  appearing  next  to  or  near  to  it. 

Tip!  Collocations  occuring  with  high  frequency  are  powerful 
indicators  of  a  pattern  of  meaning  in  a  text. 

Collocations  are  helpful  to  construct  thesauri  in  AutoMap.  AutoMap  can  identify 
collocations  of  size  2  (Bigrams)  as  shown  below: 

•  mobile  phones 
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shopping  centre 


Deletion 

Deletion  removes  non-content  bearing  conjunctions  and  articles  from  texts  (Carley, 

1993).  Non-content  bearing  concepts  to  be  deleted  from  the  texts  are  denoted  in  a  Delete 
List.  When  applying  a  Delete  List,  AutoMap  searches  the  text(s)  for  concepts  specified  in 
the  Delete  List  and  delete  matches  from  the  text(s).  Example: 

•  Original  input  text:  The  New  York  City  Police  Department  said  a  number  of 
people  were  trapped  in  elevators  for  awhile. 

•  Entries  in  the  Delete  List:  the,  a,  of,  were,  in,  for,  awhile. 

•  Text  after  deletion:  New  York  City  Police  Department  said  number  people 
trapped  elevators. 

Thesauri 

A  thesaurus  associates  concepts  with  more  abstract  concepts.  When  applying  a  thesaurus, 
AutoMap  searches  the  text  set  for  the  text-level  concepts  denoted  in  the  thesaurus  and 
translates  matches  into  the  corresponding  concept.  The  terminology  of  a  thesaurus 
depends  on  the  content  and  the  subject  of  the  data  set  (Burkart,  1997:  163;  Zuell  &  Alexa, 
2001:  313). 

Generalization  Thesaurus. 

A  generalization  thesaurus  typically  is  a  two-columned  collection  that  associates  text- 
level  concepts  with  higher- level  concepts.  The  text-level  concepts  represent  the  content 
of  a  data  set,  and  the  higher-level  concepts  represent  the  text-level  concepts  in  a 
generalized  way  (Burkart,  1997;  Klein  1997:  256;  Popping  &  Roberts  1997:  382). 

•  Example:  Copenhagen  will  be  associated  with  the  higher-level  concept  City. 

•  Related  type  of  Analysis:  Map  Analysis. 

Meta-Matrix  Thesaurus 

A  Meta-Matrix  Thesaurus  associates  text-level  concepts  with  meta-matrix  categories. 
Since  one  concept  might  need  to  be  translated  into  several  meta-matrix  categories,  a 
meta-matrix  thesaurus  can  consist  of  more  than  two  columns.  For  example,  the  concept 
"commander"  corresponds  with  the  categories  agent  and  knowledge. 

•  Example:  City  will  be  associated  with  and  translated  into  Location. 

•  Related  type  of  Analysis:  Meta-Matrix  Analysis. 

For  the  meta-matrix  thesaurus,  column  headers  start  with  concept  knowledge.  The  order 
AND  naming  of  column  headers  of  the  meta-matrix  thesaurus  can  be  changed  in  the 
XMF  file. 
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Sub-Matrix  Selection. 


The  Sub-Matrix  Selection  denotes  which  Meta-Matrix  Categories  should  be  retranslated 
into  concepts  used  as  input  for  the  meta-matrix  thesaurus. 

•  Example:  Location  will  be  selected  and  translated  into  Copenhagen, 
Oskarshamn  and  Ringhals,  if  those  concepts  were  associated  with  the  Meta- 
Matrix  category  "Location"  in  the  Meta-Matrix  Thesaurus. 

•  Related  type  of  Analysis:  Sub-Matrix  Analysis. 

Text  Pre-Processing 

2.  Hierarchy  of  Pre-Processing  Techniques 

If  you  apply  a  pre-processing  technique  of  a  lower  order  prior  to  a  technique  of  higher 
order,  the  pre-processing  will  be  maintained  through  all  following  procedures  of  higher 
order.  You  can  un-apply  each  technique  after  applying  it,  if  needed. 

Tip!  All  pre-processing  techniques  are  optional. 

If  you  wish  to  apply  multiple  pre-processing  techniques,  do  this  in  the  following  order: 

1.  Named-Entity  Recognition:  This  is  an  utility  that  does  not  impact  the  data. 

Can  be  used  before  any  type  of  analysis  is  run.  Can  be  used  before  or  after 
Stemming. 

2.  Collocation/  bigram  Identification:  This  is  an  utility  that  does  not  impact  the 
data.  Can  be  used  before  any  type  of  analysis  is  run. 

3.  Stemming:  Can  be  used  before  any  type  of  analysis  is  run.  Can  be  used 
before  or  after  Named-Entity  Recognition. 

4.  Deletion:  Can  be  used  before  any  type  of  analysis  is  run. 

5.  Thesauri: 

1.  Generalization  Thesaurus 

Can  be  applied  before  Semantic  Network  Analysis  is  run.  Can  be 
applied  before  Meta-Matrix  Thesaurus  is  applied. 

2.  Meta-Matrix  Thesaurus 

Has  to  be  applied  if  Meta-Matrix  Analysis  should  be  run. 

3.  Sub-Matrix  Selection 

Can  only  be  performed  if  Meta-Matrix  Thesaurus  was  applied.  Has  to 
be  applied  if  Sub-Matrix  Analysis  should  be  run. 

The  numbering  of  the  index  card  tabs  on  PI  and  P2  reflect  this  hierarchy  in  order  to  make 
the  sequence  of  the  pre-processing  steps  more  intuitive. 

Numbering  of  index  card  tabs  on  PI : 


6.  Texts  after  Meta-Matrix  Thesaurus  7. 

4.  Texts  after  Deletion _ _ _ 5^ 

1 .  Original  Texts  2.  Texts  after  Symbol  Removal 


Texts  after  Sub-Matrix  Selection 

Texts  after  Generalization _ 

3.  Texts  after  Stemming 
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Numbering  of  index  card  tabs  on  P2: 


;  1 .  ConcepI:  List  2,  Union  Concept  List  ^  3’  Pre-Processing  Settings  ■  4.  Analysis  Settings  | 


4.  Generalization  Thesaurus 

II  5.  Meta-Matrix  Thesaurus  |l 

6.  Sub-Matrix  Selection  | 

1.  Utilities 

2.  Stemming  |i 

3.  Delete  List 

Text  Pre-Processing 

3. 1  Named-Entity  Recognition 

To  create  a  list  of  all  Name-Entities  that  are  contained  in  the  data  set  opened,  go  to 
Utilities  (tab  no.  3)  in  P2  and  click  the  Create  and  save  Named  Entities  Eist  button  in  the 
Named-Entities  Eield.  The  resulting  list  will  be  automatically  saved  under 
NamedEntities.csv  in  the  root  directory  of  AutoMap. 

The  Named-Entity  Recognition  interface: 


1  1.  Concept  List  2.  Union  Concept  List  3.  Pre-Processing  Settings  .  4,  Analysis  Settings  | 

4.  Generalization  Thesaurus  I  5.  Meta-Matrix  Thesaurus  II  6,  Sub-Matrix  Selection  I 

1.  Utilities  2.  Stemming  |  3,  Delete  List  1 

I 

Remove  Symbols 

Undo  Removal  of  Symbols 

Named  Entity  Recognition  (Language  Independent) 

1  1  as  base  for  Generalization  Thesaurus 

1  Create  list  | 

N-Gram  Detection  (Language  Independent) 

1  Q  as  base  for  Generalization  Thesaurus 

The  black  ellipses  in  the  screen  shot  below  below  highlight  where  to  find  the 
NamedEntities.csv  file  in  your  root  directory. 


2007  AutoMap  Users  Guide  -  Page  28 


3. 1. 1  Example  for  Named-Entity  Recognition 

Resulting  NamedEntities.csv  file  after  the  following  Small  Predefined  Delete  List  was 
applied  with  rhetorical  adjacency: 


a 

John  Darren  and  Mrs.  Brown 

Mr.  Cray 

an 

He  couldn't.  Mrs.  Brown 

some 

Prof.  Darren  Mrs.  Brown  &  Mr.  Cray 

many 

13-Jun 

this 

But 

that 

Mr.  Cray's 

these 

Mr.  Craye 

those 

Problem  solved.  Then  Mrs.  Brown 

the 

Then  Mrs.  Brown 

all 

12-Jun 

one 

Weeding 

every 

She 

She  asked  Mr.  Cray 
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Mrs.  Brown 


Redundant  concepts  can  be  converted  to  one  word  by  stemming.  Concepts  not  relevant  to 
the  user  can  be  eliminated  by  deletion. 

3.2  Symbol  Removal 

In  the  lower  left  panel  (P2)  you  will  find  an  option  under  tab  no.  3  Pre-Processing 
Utilities  labeled  on  tab  no.  1  as  Symbol  Removal.  This  routine  removes  or  strips  off  all 
characters  that  are  neither  a  letter  nor  a  number.  It  maintains  sentence  marks.  It  converts 
question  marks  and  exclamation  marks  into  sentence  marks.  This  helps  replace  the 
delchar  option  on  the  delete  list  in  a  more  user-friendly  fashion.  The  overall  purpose  of 
this  routine  is  to  do  a  very  thorough  cleaning  of  the  data  in  a  fully  automated,  easy  to  use 
fashion.  This  routine  can  be  unapplied  by  using  the  un-apply  button,  which  is  located 
close  the  apply  button. 

Symbol  Removal 
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3.21  N-gram  Identification:  Bigrams 


To  create  a  list  of  all  bigrams  that  are  contained  in  the  data  set  opened,  go  to  the  Utilities, 
tab  no.  3,  in  P2  and  seleet  the  Create  Bigram  (Correlation)  List  button  in  the  N-gram 
Detection.  The  resulting  list  will  be  automatically  saved  under  CorrelationList.csv  in  the 
root  directory  of  AutoMap. 

Text  Preprocessing 

4.  Stemming 

To  stem  a  text  (or  text  set),  go  to  the  Stemming  (tab  no.  4)  in  P2. 

Porter  Stemmer: 

To  apply  the  Porter  Stemmer  select  the  Apply  button  next  to  Porter  Stemmer  and 
stemming  of  irregular  verbs  for  English.  The  stemmed  text(s)  will  be  displayed  on  the  tab 
no.  2  Stemmed  Text  in  PI. 

To  unstem  the  texts,  go  to  tab  no.  4.  Stemming  in  P2  and  select  the  Un- Apply  button.  Tab 
no.  2.  Stemmed  Text  in  PI  will  be  cleared. 

Krovetz  Stemmer: 

For  the  Krovetz  stemmer,  several  customization  options  are  offered: 

1.  Decide  whether  capitalized  words  should  be  stemmed  or  not. 

Use  radio  buttons  in  the  interface  to  make  your  selection.  By  default, 
capitalized  words  are  stemmed. 

2.  Define  words  to  be  modified  by  the  stemmer. 

These  words  are  collected  in  a  protection  list,  named 
selfdefined_protected_concepts.txt,  stored  in  the  AutoMap  root  directory 
under  utilities\KStem.  To  avoid  stemming  certain  words  put  them  in  this  list, 
one  word  per  line,  without  any  line  delimiter. 

3.  Define  specific  stems  for  certain  words. 

These  words  are  collected  in  a  list  of  pairwise  associations,  named 
selfdefined_pairs.txt,  which  are  stored  in  the  AutoMap  root  directory  under 
utilities\KStem.  To  stem  a  certain  word  into  a  pre-defined  term,  put  the  pair 
(first  word  /  pre-defined  stem)  in  the  list,  one  pair  per  line,  without  any  line 
delimiter.  The  selfdefined_pairs.txt  list  that  comes  along  with  AutoMap 
contains  already  such  pairs,  which  handle  the  correct  stemming  or  irregular 
verbs  in  English. 

To  apply  the  Krovetz  stemmer: 

Select  the  Apply  button  next  to  K-stem.  The  stemmed  text(s)  will  be  displayed  on  tab  no. 
2.  Stemmed  Text  index  card  in  PI. 
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To  unstem  the  texts,  go  to  the  tab  no.  4  Stemming  in  P2  and  select  the  Un- Apply  button. 
The  tab  no.  2  Stemmed  Text  in  PI  will  be  cleared. 

4. 1  Example  for  Stemming  (Porter) 

Stemmed  text  in  PI  and  interface  of  Stemming  index  card  on  P2: 
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File  Run  Analysis  Tools  Help 


l< 

< 

1/2 

> 

>1 

Go  to; 

OK 

6.  Texts  aPter  Meta-Matrix  Thesaurus  7.  Texts  after  Sub-Matrix  Selection 

_ 4.  Texts  after  Deletion  5.  Texts  after  Generalization 

1.  Original  Texts  2.  Texts  after  Symbol  Removal  ,  3'  Texts  after  Stemming 

Hr.  Cray's  brown  dog  eat  the  lotus  blossom  at  10  am.  Hrs. 

Brown  be  unhappy  with  the  dog.  She  yelled  at  it  saying  "You 
impossibl  dog!"  But  the  dog  keep  eating  the  flow  and  weed.  She 
asked  Mr.  Cray  to  stop  the  dog.  He  couldn't.  Hrs.  Brown 
planted  ros  and  weeded  the  gard.  The  silly  dog  %  dig  up  the 
ros  looking  for  a  vol  on  Jun  12,  1880.  Weeding  be  no  long 
needed. 


< 

» 


Prof.  Dar,  Mrs.  Brown  s  Hr.  Cray  meet  the  next  day  to  concoct 
a  plan.  John  Dar  and  Hrs.  Brown  put  up  a  scarecrow.  She  think 
it  be  scar  the  dog.  Hr.  Craye  putup  a  fenc.  Problem  solved. 
Then  Mrs.  Brown  planted  lotus,  carnation,  daffodil,  and  ros. 


1 .  Concept  List  2.  Union  Concept  List  3.  Pre-Processing  Settings  4.  Analysis  Settings 


Text  Preprocessing 

5.  Deletion 

The  Delete  List  is  not  case  sensitive. 
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You  can  use  the  predefined  Delete  Lists  that  AutoMap  offers  or  create  your  own  Delete 

List.  All  lists  can  be  edited. 

5. 1  Open  a  Delete  List 

Click  the  File  menu,  select  Open  Delete  List  and  choose  one  of  the  following  options: 

•  Open  from  file:  A  file  chooser  will  appear.  Select  a  delete  list  and  hit  the  Open 
button. 

•  Open  small  predefined  Delete  List:  AutoMap's  predefined  small  delete  list  will 
be  opened. 

•  Open  extensive  predefined  Delete  List.  AutoMap's  extensive  small  delete  list 
will  be  opened. 

The  black  ellipses  in  the  screen  shot  below  shows  where  to  access  the  Delete  List  utility: 


^  Run  Analysis  Tools  Help 
Open  single  file 


Open  Delete  List 


^7"  ""'tinn  Thnaums 

Open  Meta-Matrix  Thesaurus 
Open  Sub-Matrix  Selection 
Create  and  Refresh  Union  Concept  List 
Output  Storage  Manager 

Save  Concept  List  per  Text 
liJ  Save  Union  Concept  List 
Save  applied  Delete  List 
Save  applied  Generalization  Thesaurus 
d  Save  applied  Meta-Matrix  Thesaurus 
[‘jj  Save  Sub-Matrix  Selection 
d  Save  Text(s)  after  Stemmer  applied 
d  Save  Text(s)  after  Delete  List  applied 
lU  Save  Text(s)  after  Generalization  Thesaurus  applied 
y  Save  Text(s)  after  Meta-Matrix  Thesaurus  applied 
y  Save  Text(s)  after  Sub-Matrix  selection  applied 


Open  from  file 

Open  small  predefined  Delete  List 
Open  extensive  predefined  Delete  List 


The  Delete  List  will  be  displayed  in  P2,  tab  no.  5.  Delete  List  index  card: 
Interface  of  the  Delete  List  index  card: 
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1 .  Concept  List  2.  Union  Concept  List  3’  Pre-Processing  Settings  4,  Analysis  Settings 


The  Delete  List  can  be  edited,  (see  section  5.3  Edit  a  Delete  List) 

5. 1. 1  Small  predefined  Delete  List 

The  Delete  List  is  compiled  of  words  that  occur  most  frequently  in  English:  a,  an,  and, 
some,  many,  this,  that,  these,  those,  the,  all,  one,  every. 

The  Small  Delete  List  can  be  edited,  (see  section  5.3  Edit  a  Delete  List) 

5. 1.2  Extensive  predefined  Delete  List 

An  Extensive  Delete  List  is  based  on  words  occurring  most  frequently  in  English:  a,  an, 
and,  as,  at,  but,  for,  he,  her,  hers,  him,  his,  i,  it,  its,  me,  mine,  my,  nor,  of,  or,  our,  she,  so, 
that,  the,  their,  theirs,  them,  they,  to,  us,  we,  who,  whoever,  whom,  whomever,  will, 
would,  you,  your,  yours,  yourself.  As  the  name  indicates,  the  Extensive  Delete  list 
contains  more  words  than  the  Small  Predefined  Delete  List. 

The  Extensive  Delete  List  can  be  edited,  (see  section  5.3  Edit  a  Delete  List) 

5.2  Create  a  Delete  List 

There  are  two  ways  to  create  a  Delete  List: 

1.  Within  AutoMap: 

Go  to  the  Delete  List  index  card.  The  general  structure  of  a  Delete  List  is  one 
single  concept  per  line.  Add  concepts  by  typing  one  concept  per  line.  Hit  enter 
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after  entering  a  concept.  Avoid  empty  lines.  See  the  example  for  more 
information. 

2. 

3.  Outside  of  AutoMap: 

Use  a  text  editor  to  create  a  Delete  List.  Please  consider  these  instructions  to 
create  a  Delete  List: 

1.  The  general  structure  of  a  Delete  List  is  one  single  Concept  per  line. 

2.  Avoid  empty  lines. 

3.  The  Delete  List  is  NOT  case  sensitive. 

4.  Save  the  List. 

5.  Open  the  Delete  List  in  AutoMap. 

6.  You  can  edit  the  Delete  List  in  AutoMap  if  you  wish. 

5.3.  Edit  a  Delete  List 

On  the  Delete  List  index  card  you  can: 

•  Add  concepts:  Type  one  concept  per  line.  Hit  enter  after  entering  a  concept. 

•  Modify  concepts:  Go  the  the  line  and  retype  concept. 

•  Drop  concepts:  Mark  the  concept  and  hit  the  delete  key. 

5.4  Apply  a  Delete  List 

If  you  wish  to  apply  a  Delete  List  and  a  Thesaurus  we  recommend  first  applying  a  Delete 
List  and  then  a  Thesaurus.  Next,  follow  these  steps: 

1.  Before  applying  a  delete  list,  an  adjacency  option  can  be  chosen  on  the  Delete 
List  index  card.  Adjacency  can  be  either  direct  (default)  or  rhetorical.  If  the 
user  does  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency 
for  deletion  and  analysis. 

2.  To  delete  the  concepts  specified  in  the  Delete  List  from  all  texts  loaded  click 
the  Apply  Delete  List  button  on  the  Delete  Concepts  Index  card. 

3.  See  the  pre-processed  texts  in  PI,  Delete  List  (tab  no.  3). 

When  applying  a  Delete  List  AutoMap  does  three  things: 

1.  Search  the  text(s)  for  concepts  specified  in  the  Delete  List. 

2.  Delete  matches  from  the  text(s). 

3.  Display  the  resulting  text(s)  in  PI,  Delete  List  (tab  no.  3). 

If  direct  adjacency  was  chosen,  concepts  specified  in  the  delete  list  are  simply 
deleted  from  texts  and  concepts  left  and  right  to  deleted  concepts  will  appear 
adjacent  to  each  other  in  terms  of  visualization  and  statement  formation. 

If  rhetorical  adjacency  was  chosen  placeholders  (xxx)  are  inserted  where  a  concept  was 
deleted.  The  placeholders  retain  original  distances  of  maintained  concepts  for  purposes  of 
visualization  and  analysis. 

To  apply  multiple  delete  lists  load  the  first  one  in,  apply  it,  then  load  in  the  next,  apply  it, 
and  so  on. 
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5.4. 1  Direct  Adjacency 


If  direct  adjacency  is  chosen,  concepts  in  the  text  that  match  concepts  specified  in  the 
delete  list  will  be  deleted  from  texts.  As  a  result  concepts  left  and  right  of  a  deleted 
concept  move  together  and  will  be  treated  as  directly  adjacent  to  each  other  for 
visualization  and  analysis. 

To  apply  direct  adjacency  check  the  radio  button  in  the  Delete  List  index  card.  Then 
apply  the  delete  list.  If  the  user  does  not  change  the  adjacency  option,  AutoMap  uses 
direct  adjacency  for  deletion  and  analysis. 

5.4.2  Rhetorical  adjacency 

If  rhetorical  adjacency  is  chosen  placeholders  "xxx"  are  inserted  where  a  concept  was 
deleted.  The  placeholders  retain  the  original  distances  of  the  maintained  concepts  visually 
for  analysis. 

To  apply  direct  adjacency  check  the  button  on  the  Delete  List  tab.  Then  apply  the  delete 
list. 

If  the  user  does  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  for 
deletion  and  analysis. 

5.5  Un-Apply  a  Delete  List 

To  un-apply  a  Delete  List  that  was  applied  to  the  data,  in  P2  go  to  the  Delete  List  (tab  no. 
5)  index  card  and  select  the  Un- Apply  button.  The  tab  no.  3  Delete  List  index  card  on  PI 
will  be  cleared. 

5.6  Save  an  applied  Delete  List 

To  save  a  Delete  List  that  you  have  applied  to  the  data,  click  the  File  menu,  select  Save 
Delete  List  as.  A  file  chooser  will  pop  up. 

5.7  Save  text(s)  after  application  of  Delete  List 

To  save  the  text(s)  after  the  application  of  the  Delete  List,  click  the  File  menu,  select 
Save  Text(s)  after  Delete  List  applied.  All  texts  are  automatically  saved  in  a  folder  called 
"preprocessed"  in  the  root  directory  of  AutoMap.  The  filename  will  be 
"after_DL_NameOfY  ourText.txt" . 

5.8  Examples  for  the  application  of  a  Delete  List 

Input  text  Tool  used  Setting  Resulting  text 

Mr.  Cray's  brown  dog  AutoMap's 

ate  the  lotus  blossom  at  extensive  Direct  Mr.  Cray's  brown  dog  ate  lotus 

Adjacency  blossom  10  am.  Mrs.  Brown  was 
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10  am.  Mrs.  Brown  was 
unhappy  with  the  dog. 
She  yelled  at  it  saying 
"You  impossible  dog!" 
But  the  dog  kept  eating 
the  flowers  and  weeds. 
She  asked  Mr.  Cray  to 
stop  the  dog.  He 
couldn't.  Mrs.  Brown 
planted  roses  and 
weeded  the  garden.  The 
silly  dog  %  dug  up  the 
roses  looking  for  a  vole 
on  June  12,  1880. 
Weeding  was  no  longer 
needed.  Prof.  Darren, 
Mrs.  Brown  &  Mr.  Cray 
met  the  next  day  to 
concoct  a  plan.  John 
Darren  and  Mrs.  Brown 
put  up  a  scarecrow.  She 
thought  it  would  scare 
the  dog.  Mr.  Craye  put 
up  a  fence.  Problem 
solved.  Then  Mrs. 

Brown  planted  lotus, 
carnations,  daffodils, 
and  roses. 


Delete  List: 

a,  an,  and, 
as,  at,  but, 
for,  he, 
her,  hers, 
him,  his,  i, 
it,  its,  me, 
mine,  my, 
nor,  of,  or, 
our,  she, 
so,  that, 
the,  their, 
theirs, 
them,  they, 
to,  us,  we, 
who, 
whoever, 
whom, 
whomever, 
wiii,  wouid, 
you,  your, 
yours, 
yourseif 


(default)  unhappy  with  dog.  yelled  saying 
"You  impossible  dog!"  dog  kept 
eating  flowers  weeds,  asked  Mr. 

Cray  stop  dog.  couldn't.  Mrs.  Brown 
planted  roses  weeded  garden,  silly 
dog  %  dug  up  roses  looking  vole  on 
June  12,  1880.  Weeding  was  no 
longer  needed.  Prof.  Darren,  Mrs. 
Brown  &  Mr.  Cray  met  next  day 
concoct  plan.  John  Darren  Mrs. 

Brown  put  up  scarecrow,  thought 
scare  dog.  Mr.  Craye  put  up  fence. 
Problem  solved.  Then  Mrs.  Brown 
planted  lotus,  carnations,  daffodils, 
roses. 

Mr.  Cray's  brown  dog  ate  xxx  lotus 
blossom  xxx  10  am.  Mrs.  Brown  was 
unhappy  with  xxx  dog.  xxx  yelled 
xxx  xxx  saying  "You  impossible 
dog!"  xxx  xxx  dog  kept  eating  xxx 
flowers  xxx  weeds,  xxx  asked  Mr. 
Cray  xxx  stop  xxx  dog.  xxx  couldn't. 
Mrs.  Brown  planted  roses  xxx 
weeded  xxx  garden,  xxx  silly  dog  % 
Rhetorical  dug  up  xxx  roses  looking  xxx  xxx 
Adjacency  vole  on  June  12,  1880.  Weeding  was 
no  longer  needed.  Prof.  Darren,  Mrs. 
Brown  &  Mr.  Cray  met  xxx  next  day 
xxx  concoct  xxx  plan.  John  Darren 
xxx  Mrs.  Brown  put  up  xxx 
scarecrow,  xxx  thought  xxx  xxx 
scare  xxx  dog.  Mr.  Craye  put  up  xxx 
fence.  Problem  solved.  Then  Mrs. 
Brown  planted  lotus,  carnations, 
daffodils,  xxx  roses. 


The 

following 
concepts 
were  added 
to  the 

Delete  List.  Direct 
Then  the  Adjacency 
Delete  List  (default) 
was  applied 
again. 

from,  in, 
what,  was, 
with 


Mr.  Cray's  brown  dog  ate  lotus 
blossom  10  am.  Mrs.  Brown  unhappy 
dog.  yelled  saying  "You  impossible 
dog!"  dog  kept  eating  flowers  weeds, 
asked  Mr.  Cray  stop  dog.  couldn't. 
Mrs.  Brown  planted  roses  weeded 
garden,  silly  dog  %  dug  up  roses 
looking  vole  on  June  12,  1880. 
Weeding  no  longer  needed.  Prof. 
Darren,  Mrs.  Brown  &  Mr.  Cray  met 
next  day  concoct  plan.  John  Darren 
Mrs.  Brown  put  up  scarecrow, 
thought  scare  dog.  Mr.  Craye  put  up 
fence.  Problem  solved.  Then  Mrs. 
Brown  planted  lotus,  carnations, 
daffodils,  roses. 


Rhetorical  Mr.  Cray's  brown  dog  ate  xxx  lotus 
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Adjacency  blossom  xxx  10  am.  Mrs.  Brown  xxx 
unhappy  xxx  xxx  dog.  xxx  yelled  xxx 
xxx  saying  "You  impossible  dog!" 
xxx  xxx  dog  kept  eating  xxx  flowers 
xxx  weeds,  xxx  asked  Mr.  Cray  xxx 
stop  xxx  dog.  xxx  couldn't.  Mrs. 
Brown  planted  roses  xxx  weeded  xxx 
garden,  xxx  silly  dog  %  dug  up  xxx 
roses  looking  xxx  xxx  vole  on  June 
12,  1880.  Weeding  xxx  no  longer 
needed.  Prof.  Darren,  Mrs.  Brown  & 
Mr.  Cray  met  xxx  next  day  xxx 
concoct  xxx  plan.  John  Darren  xxx 
Mrs.  Brown  put  up  xxx  scarecrow, 
xxx  thought  xxx  xxx  scare  xxx  dog. 
Mr.  Craye  put  up  xxx  fence.  Problem 
solved.  Then  Mrs.  Brown  planted 
lotus,  carnations,  daffodils,  xxx 
roses. 


Text  Preprocessing 

6.  Generalization  Thesaurus 

The  Generalization  Thesaurus  is  NOT  case  sensitive. 

6. 1.  Open  a  Generalization  Thesaurus 

Click  the  File  menu,  select  Open  Generalization  Thesaurus.  A  file  chooser  will  pop  up. 
Double  click  the  thesaurus  you  wish  to  wish  to  open  or  single  click  the  thesaurus  and  then 
hit  the  Open  button.  The  thesaurus  will  be  displayed  on  P2  6.  Generalization  Thesaurus 
index  card. 

Generalization  Thesaurus  Interface  index  card: 
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1 .  Concept  List  |  2,  Union  Concept  List  3'  Pre-Processing  Settings  4.  Analysis  Settings 


6.2.  Create  a  Generalization  Thesaurus 

There  are  two  ways  to  create  a  Thesaurus: 

1.  Within  AutoMap: 

Go  to  P2,  (tab  no.  6)  Generalization  Thesaurus  (see  also  the  interface  of  the 
Generalization  Thesaurus  index  card  for  an  example). 

Use  the  Text  Area  on  this  Index  card. 

Build  and  edit  a  thesaurus. 

AutoMap  supports  users  in  building  a  generalization  thesaurus  by  loading  the  union  of 
concepts  from  the  highest  level  of  pre-processing  applied  into  the  Generalization 
Thesaurus  field.  This  is  found  on  the  Generalization  Thesaurus  index  card  and  can  be 
used  upon  demand. 

Follow  these  steps  to  load  the  union  concept  list  into  the  Generalization  Thesaurus  field: 

1.  Create  or  refresh  the  Union  Concept  List. 

2.  Hit  the  Load  Union  Concept  List  button  on  the  Generalization  Thesaurus  index 
card. 

This  concept  list  loaded  into  AutoMap  can  be  refined  by  applying  Named-Entity 
Recognition  and  Deletion  prior  to  Generalization. 
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Here  is  an  example  for  multi-step  pre-processing: 


To  further  illustrate  multi-step  pre-processing  techniques,  copy  the  text  passages  below, 
then  save  as  a  TXT  file  as  "Our  Text  I.txt"  and  "Our  Text  Il.txt"  respectivley.  Load  these 
examples  into  Automap  to  follow  along. 

Input  texts: 

Our  Text  I.txt 

Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs.  Brown  was  unhappy  with 
the  dog.  She  yelled  at  it  saying  "You  impossible  dog!"  But  the  dog  kept  eating  the 
flowers  and  weeds.  She  asked  Mr.  Cray  to  stop  the  dog.  He  couldn't.  Mrs.  Brown 
planted  roses  and  weeded  the  garden.  The  silly  dog  %  dug  up  the  roses  looking  for  a 
vole  on  June  12,  1880.  Weeding  was  no  longer  needed.  Prof.  Darren,  Mrs.  Brown  & 
Mr.  Cray  met  the  next  day  to  concoct  a  plan.  John  Darren  and  Mrs.  Brown  put  up  a 
scarecrow.  She  thought  it  would  scare  the  dog.  Mr.  Craye  put  up  a  fence.  Problem 
solved.  Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils,  and  roses. 

Our  Text  Il.txt: 


Mr.  Cray's  brown  dog  stopped  eating  the  lotus  blossom  at  12  pm  the  next  day.  Mrs. 
Brown  was  now  happy  with  the  dog.  She  said  "You  good  dog!"  The  dog  no  longer  ate 
the  flowers  and  weeds.  Mr.  Cray  was  pleased  too.  Mrs.  Brown  watered  the  roses  and 
fertilized  the  garden  on  June  13,  1880.  Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  over 
dinner  and  discussed  how  the  plan  had  worked.  John  Darren  and  Mrs.  Brown  would 
take  down  the  scarecrow  the  following  week.  She  thought  it  was  too  scary  for  the 
dog.  Mr.  Craye  painted  his  fence.  Then  Mrs.  Brown  watered  lotus,  carnations, 
daffodils,  and  roses. 


Pre¬ 

processing 

technique 

applied 


Entries 


1.  Create 

Named-Entity 

List 


Result 


John  Darren  and  Mrs.  Brown 
Mr.  Cray 

He  couldn't.  Mrs.  Brown 

Prof.  Darren  Mrs.  Brown  &  Mr.  Cray 

13-Jun 

But 

Mr.  Cray's 
Mr.  Craye 

Problem  solved.  Then  Mrs.  Brown 

Then  Mrs.  Brown 

12-Jun 

Weeding 

She 

She  asked  Mr.  Cray 
Mrs.  Brown 


2.  Create  Mr.  Cray's/Mr. 

Named-Entity  Craye  Mr. 
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List  and  use  it 
to  build  a 
generalization 
thesaurus. 

3.  Add  further 
words  that 
belong 
together  to 
the 

generalization 

thesaurus. 


4.  Apply 
generalization 
thesaurus  (no 
thesaurus 
content  only). 


3.  Deletion 
(rhetorical 
adjacency). 


4.  Create 
Union  Concept 
List. 

5.  Load  Union 
Concept  List 
into  AutoMap. 


Craye/Mr.  Cray's 


Prof. 

Darren/Prof_Darren 


an 

some 

many 

this 

that 

these 

those 

the 

all 

one 

every 


mr.  craye  brown  dog  ate  the  lotus  blossom  at  10 
am.  mrs.  brown  was  unhappy  with  the  dog.  she 
yelled  at  it  saying  "you  impossible  dog!"  but  the 
dog  kept  eating  the  flowers  and  weeds,  she  asked 
mr.  Cray  to  stop  the  dog.  he  couldn't,  mrs.  brown 
planted  roses  and  weeded  the  garden,  the  silly  dog 
%  dug  up  the  roses  looking  for  a  vole  on  June  12, 
1880.  weeding  was  no  longer  needed. 

mrs.  brown  &  mr.  cray,  mrs.  brown  &  mr.  cray 
met  the  next  day  to  concoct  a  plan.  John  darren 
and  mrs.  brown  put  up  a  scarecrow,  she  thought  it 
would  scare  the  dog.  mr.  Cray's  put  up  a  fence, 
problem  solved,  brown  planted  lotus,  carnations, 
daffodils,  and  roses. 

Mr.  Cray's  brown  dog  ate  xxx  lotus  blossom  at  10 
am.  Mrs.  Brown  was  unhappy  with  xxx  dog.  She 
yelled  at  it  saying  "You  impossible  dog!"  But  xxx 
dog  kept  eating  xxx  flowers  and  weeds.  She  asked 
Mr.  Cray  to  stop  xxx  dog.  He  couldn't.  Mrs. 
Brown  planted  roses  and  weeded  xxx  garden,  xxx 
silly  dog  %  dug  up  xxx  roses  looking  for  xxx  vole 
on  June  12,  1880.  Weeding  was  no  longer  needed. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  xxx 
next  day  to  concoct  xxx  plan.  John  Darren  and 
Mrs.  Brown  put  up  xxx  scarecrow.  She  thought  it 
would  scare  xxx  dog.  Mr.  Craye  put  up  xxx  fence. 
Problem  solved.  Then  Mrs.  Brown  planted  lotus, 
carnations,  daffodils,  and  roses. 

(Coincides  with  Union  Concept  List  loaded  into 
AutoMap,  see  cell  below) 

1880 

asked 

blossom 
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brown 

but 

concoct 

craye 

daffodils 

dinner 

discussed 

dog 

dog!" 

down 

dug 

fence 

for 

his 

how 

it 

looking 

lotus 

on 

over 

plan 

pm 

problem 

said 

saying 

scary 

solved 

stopped 

take 

up 

watered 

weeded 

weeding 

weeds 

with 

worked 

yelled 

Outside  of  AutoMap:  Use  a  text  editor  to  create  a  Thesaurus. 

Build  and  edit  a  thesaurus. 

Save  the  Thesaurus. 

Open  the  Thesaurus  in  AutoMap. 

You  can  edit  the  Thesaurus  in  AutoMap  if  you  wish. 
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6.3  Edit  a  Generalization  Thesaurus 


You  can  add,  change  or  drop  the  lines  of  a  thesaurus  on  P2,  (tab  no.  6)  Generalization 
Thesaurus. 

The  general  structure  of  a  Thesaurus  follows  the  five  points  below  (see  also  the  interface 
of  the  Generalization  Thesaurus  index  card  for  an  example): 

1.  Every  line  contains  Concept  /  Key  Concept  or  in  other  words  Old  Word/  New 
Word. 

2.  A  Concept  can  be  one  or  more  words. 

3.  A  Key  Concept  is  one  word. 

4.  Be  sure  to  separate  the  words  by  a  slash. 

5.  The  Thesaurus  is  NOT  case  sensitive. 

6.4  Apply  a  Generalization  Thesaurus 

If  you  wish  to  apply  a  Delete  List  and  a  Generalization  Thesaurus  please  be  sure  to  use 
the  Delete  List  first  and  then  the  Thesaurus.  Then  go  through  the  following  process: 

1.  Decide  if  you  want  to  use  the  Thesaurus  content  only  option  or  not.  If  you  do 
not  select  the  Thesaurus  content  only  option  this  setting  will  not  be  applied. 

2.  If  you  select  the  Thesaurus  content  only  option  you  can  choose  an  adjacency 
option. 

3.  Adjacency  can  be  either  direct  (default)  or  rhetorical. 

4.  To  apply  your  Generalization  Thesaurus  with  the  settings  you  have  specified 
click  the  Apply  Thesaurus  button  on  the  Generalization  Thesaurus  Index  card. 
AutoMap  uses  the  entries  in  the  Thesaurus  to  search  the  text(s)  for  concepts. 
If  a  match  is  found  it  will  be  translated  into  a  key  concept.  Again,  the 
Thesaurus  is  NOT  case  sensitive. 

5.  See  the  pre-processed  texts  on  the  PI,  (tab  no.  4)  Generalization  Thesaurus. 
If  the  Thesaurus  content  only  option  and  Direct  Adjacency  were  chosen  only 
key  concepts  would  be  displayed  and  considered  for  analysis.  If  the  Thesaurus 
content  only  option  and  Rhetorical  Adjacency  were  chosen  key  concepts  and 
their  original  distances,  which  are  symbolized  by  place  holders  (xxx),  are 
displayed  and  considered  for  analysis. 

6.4. 1  Thesaurus  content  only 

If  the  Thesaurus  content  only  option  is  chosen  AutoMap  performs  the  following  steps: 

1.  Search  the  text(s)  for  concepts  specified  in  the  thesaurus. 

2.  Translate  matches  into  key  concepts. 

3.  Maintain  only  key  concepts  in  the  pre-processed  texts.  The  rest  of  the  input 
text  is  dropped  and  will  not  be  considered  for  further  pre-processing  or 
analysis.  The  original  distances  of  the  key  concepts  will  not  be  maintained. 
However,  punctuation  marks  like  the  end  of  sentences  and  paragraphs  are 
maintained  and  considered  for  analysis. 

4.  As  a  result,  all  key  concepts  in  the  resulting  text  appear  directly  adjacent  to 
each  other. 
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To  select  the  Thesaurus  content  only  option  check  the  Thesaurus  content  only  item  on  P2, 
(tab  no.  6.)  Generalization  Thesaurus  index  card.  You  can  now  choose  to  either  use  direct 
or  rhetorical  adjacency  for  the  application  of  the  Generalization  Thesaurus.  Then  apply 
the  Generalization  Thesaurus.  To  switch  from  not  using  the  Thesaurus  content  only 
option  uncheck  the  Thesaurus  content  only  item  on  P2,  (tab  no.  6.)  Generalization 
Thesaurus  and  apply  the  Generalization  Thesaurus  again. 

If  the  Thesaurus  content  only  option  is  NOT  chosen  AutoMap  performs  the  following 
steps: 

1.  Search  the  text(s)  for  concepts  specified  in  the  thesaurus. 

2.  Translate  matches  into  key  concepts. 

3.  Keep  the  rest  of  the  text  as  it  is.  This  means,  all  other  concepts  in  the  text 
that  did  not  match  concepts  specified  in  the  thesaurus  will  not  be  affected  in 
any  way.  Original  distances  of  both  unaffected  concepts  and  key  concepts  will 
be  maintained.  This  rule  does  not  apply  if  a  concept  consisting  of  more  than 
one  word  was  translated  into  a  key  concept. 

The  Thesaurus  content  only  item  on  P2,  (tab  no.  6)  Generalization  Thesaurus  index  card 
by  default  is  not  checked  because  AutoMap  does  not  apply  the  Thesaurus  content  only 
option.  To  switch  to  using  the  Thesaurus  content  only  option  check  the  Thesaurus  content 
only  item  on  P2,  (tab  no.  6)  Generalization  Thesaurus  and  then  apply  the  Generalization 
Thesaurus  again. 

6.4. 1. 1  Direct  Adjacency 

Direct  adjacency  means  that  original  distances  of  concepts  that  represent  the  key  concepts 
will  neither  be  visualized  nor  considered  for  analysis. 

To  choose  the  direct  adjacency  click  the  Direct  button  in  the  Adjacency  field  on  P2,  (tab 
no.  6)  Generalization  Thesaurus  index  card.  Then  apply  the  Generalization  Thesaurus.  If 
the  user  does  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  for 
generalization  and  analysis. 

6.4. 1.2  Rhetorical  Adjacency 

Rhetorical  adjacency  means  that  the  original  distance  of  key  concepts  will  be  considered 
for  the  analysis.  Original  distances  of  concepts  that  represent  the  key  concepts  will  be 
visually  symbolized  by  placeholders  (xxx)  and  considered  for  analysis.  Rhetorical 
adjacency  can  only  be  applied  if  the  Thesaurus  content  only  option  was  not  chosen. 

To  choose  the  rhetorical  adjacency  click  the  Rhetorical  button  in  the  Adjacency  field  on 
P2,  Generalization  Thesaurus  (tab  no.  6).  Then  apply  the  Generalization  Thesaurus. 

If  the  user  does  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  for 
analysis. 


2007  AutoMap  Users  Guide  -  Page  45 


6.5  Un-Apply  a  Generalization  Thesaurus 


To  un-apply  a  Generalization  Thesaurus  that  was  applied  to  the  data,  go  to  P2, 
Generalization  Thesaurus  (tab  no.  6)  and  hit  the  Un- Apply  button.  The  Generalization 
Thesaurus  (Tab  no.  4)  on  PI  will  be  cleared. 

6.6  Save  an  applied  Generalization  Thesaurus 

To  save  a  Generalization  Thesaurus  that  you  have  applied  to  the  data,  click  the  File 
menu,  select  Save  Generalization  Thesaurus  As  (a  file  chooser  will  pop  up). 

6.7  Save  text(s)  after  application  of  Generalization  Thesaurus 

To  save  the  text(s)  after  the  application  of  the  Generalization  Thesaurus,  click  the  File 
menu,  select  Save  Text(s)  after  Generalization  Thesaurus  applied.  All  texts  are 
automatically  saved  in  a  folder  called  "preprocessed"  in  the  root  directory  of  AutoMap. 
The  filename  will  be  after_general_thes_NameOfYourText.txt". 

6.8  Example  for  the  building  and  applying  a  Generalization 
Thesaurus 


Applying  a  thesaurus  to  text  that  was  not  pre-processed 


Input 

text 


Tool  used  Setting 


Resulting  text 


Mr.  Cray's 
brown  dog 
ate  the 
lotus 

blossom  at 
10  am. 


Mrs. 

Brown 

was 

unhappy 
with  the 
dog.  She 
yelled  at  it 
saying 
"You 


Mr  Craye/Mr_Cray 
Mrs  Browns/Mrs_Brown 
John 

Darren/Prof_John_Darren 
Prof  Not  Selected 

Darren/Prof_John_Darren  Thesaurus 
yelled/yelling  content  only 

ate/eating  (default) 


impossible 
dog!"  But 
the  dog 
kept 

eating  the 
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mr.  Cray's  brown  dog  eating 
the  lotus  blossom  at  10  am. 
mrs.  brown  was  unhappy  with 
the  dog.  she  yelling  at  it 
saying  "you  impossible  dog!" 
but  the  dog  kept  eating  the 
flowers  and  weeds,  she  asked 
mr.  Cray  to  stop  the  dog.  he 
couldn't,  mrs.  brown  planted 
roses  and  weeded  the  garden, 
the  silly  dog  %  dug  up  the 
roses  looking  for  a  vole  on 
June  12,  1880.  weeding  was 
no  longer  needed,  prof. 

Pagdacren,  mrs.  brown  &  mr.  cray 
met  the  next  day  to  concoct  a 
plan.  prof_John_darren  and 
mrs.  brown  put  up  a 


flowers 

and 

weeds. 

She  asked 

Mr.  Cray 

to  stop  the 

dog.  He 

couldn't. 

Mrs. 

Brown 

planted 

roses  and 

weeded 

the 

garden. 
The  silly 
dog  %  dug 
up  the 
roses 
looking 
for  a  vole 
on  June 
12,  1880. 
Weeding 
was  no 
longer 
needed. 

Prof. 

Darren, 

Mrs. 

Brown  & 
Mr.  Cray 
met  the 
next  day 
to  concoct 
a  plan. 
John 
Darren 
and  Mrs. 
Brown  put 
up  a 

scarecrow. 

She 

thought  it 
would 


Thesaurus 


daffodils,  and  roses. 


content  only, 
Direct  Adjacency 
(default  if 
Thesaurus 


eating...  yelling... 
p  rof_J  0  h  n_d  a  rren 


content  only  is 
chosen) 


///  ■ 


XXX.  XXX  XXX  XXX  eating  xxx 
XXX  xxx  xxx  xxx  xxx.  xxx.  xxx 
xxx  xxx  xxx  xxx  xxx.  xxx 
yelling  xxx  xxx  xxx  xxx  xxx 
xxx  xxx  xxx  xxx  xxx  xxx  xxx 
xxx  xxx  xxx.  xxx  xxx  xxx.  xxx 
xxx  xxx  xxx  xxx.  xxx  xxx.  xxx. 

Selected  - 

Thesaurus  xxx  xxx  xxx  xxx  xxx  xxx  xxx 

content  only,  xxx  xxx  xxx  xxx  xxx  xxx,  xxx. 

Rhetorical  xxx  xxx  xxx  xxx  xxx. 

Adjacency 

xxx.  xxx,  xxx.  xxx  xxx.  xxx 
xxx  xxx  xxx  xxx  xxx  xxx  xxx. 
profjohn_darren  xxx  xxx.  xxx 
xxx  xxx  xxx.  xxx  xxx  xxx  xxx 
xxx  xxx  xxx.  xxx.  xxx  xxx  xxx. 
xxx  xxx.  xxx  xxx.  xxx  xxx  xxx, 
xxx,  xxx,  xxx  xxx. 
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scare  the 
dog.  Mr. 
Craye  put 
up  a  fence. 
Problem 
solved. 
Then  Mrs. 
Brown 
planted 
lotus, 
carnations, 
daffodils, 
and  roses. 


Applying  a  thesaurus  to  text  that  was  pre-processed  with  a 
Delete  List,  direct  adjacency: 


Input  text 

Mr.  Cray's 
brown  dog 
ate  lotus 
blossom  10 
am.  Mrs. 
Brown  was 
unhappy 
with  dog. 
yelled 

saying  "You 
impossible 
dog!"  dog 
kept  eating 
flowers 
weeds, 
asked  Mr. 
Cray  stop 
dog. 
couldn't. 
Mrs.  Brown 
planted 
roses 
weeded 
garden,  silly 
dog  %  dug 
up  roses 
looking  vole 
on  June  12, 


Tool  used 


Thesaurus 
(same  as 
above) 


Resulting  text 

mr.  Cray's  brown  dog  eating  lotus 
blossom  10  am.  mrs.  brown  was 
unhappy  with  dog.  yelling  saying 
"you  impossible  dog!"  dog  kept 
eating  flowers  weeds,  asked  mr. 
Cray  stop  dog.  couldn't,  mrs. 
brown  planted  roses  weeded 
garden,  silly  dog  %  dug  up  roses 
looking  vole  on  June  12,  1880. 
weeding  was  no  longer  needed, 
prof,  darren,  mrs.  brown  &  mr. 
Cray  met  next  day  concoct  plan. 
prof_John_darren  mrs.  brown  put 
up  scarecrow,  thought  scare  dog. 
mr.  craye  put  up  fence,  problem 
solved,  then  mrs.  brown  planted 
lotus,  carnations,  daffodils, 
roses. 

Selected  -  Thesaurus 
content  only.  Direct 

Adjacency  eating...  yelling . ,.. 

(default  if  Thesaurus  .,...  prof_john_darren . ,„. 

content  only  is 
chosen) 

XXX.  XXX  XXX  XXX  eating  xxx  xxx 
Thesaurus  content  xxx  xxx.  xxx.  xxx  xxx  xxx  xxx 
only.  Rhetorical  xxx.  yelling  xxx  xxx  xxx  xxx  xxx 
Adjacency  xxx  xxx  xxx  xxx.  xxx  xxx.  xxx 

xxx  xxx.  xxx.  xxx.  xxx  xxx  xxx 


Setting 


Not  selected  - 
Thesaurus  content 
only  (default) 
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1880. 
Weeding 
was  no 
longer 
needed. 

Prof. 

Darren,  Mrs. 
Brown  & 

Mr.  Cray 
met  next  day 
concoct 
plan.  John 
Darren  Mrs. 
Brown  put 
up 

scarecrow, 
thought 
scare  dog. 
Mr.  Craye 
put  up  fence. 
Problem 
solved.  Then 
Mrs.  Brown 
planted 
lotus, 
carnations, 
daffodils, 
roses. 


XXX  XXX.  XXX  XXX  XXX  XXX  XXX  XXX 
XXX  XXX  XXX  XXX,  XXX.  XXX  XXX 
XXX  XXX  XXX. 

XXX.  XXX,  XXX.  XXX  XXX.  XXX  XXX 
XXX  XXX  XXX  XXX. 

prof_john_darren  xxx.  xxx  xxx 
XXX  xxx.  xxx  xxx  xxx.  xxx.  xxx 
xxx  xxx.  xxx  xxx.  xxx  xxx.  xxx 
xxx  xxx,  xxx,  xxx,  xxx. 


Applying  a  thesaurus  to  text  that  was  pre-processed  with  a 
Delete  List,  rhetorical  adjacency: 


Input  text  Tool  used  Setting 


Resulting  text 


Text  after 

application 

of 

Customized 
extensive 
delete  list, 
rhetorical 
adjacency: 


Thesaurus 
(same  as 
above) 


Mr.  Cray's 
brown  dog 
ate  xxx  lotus 


Not  Selected  -  mr.  Cray's  brown  dog  eating  xxx 

Thesaurus  content  lotus  blossom  xxx  10  am.  mrs. 

only  (default)  brown  was  unhappy  with  xxx  dog. 

xxx  yelling  xxx  xxx  saying  "you 

impossible  dog!"  xxx  xxx  dog  kept 

eating  xxx  flowers  xxx  weeds,  xxx 

asked  mr.  cray  xxx  stop  xxx  dog. 

xxx  couldn't,  mrs.  brown  planted 

roses  xxx  weeded  xxx  garden,  xxx 
2007  AutoMap  Users  Guide  -  %  ^^g  ,03^, 

looking  xxx  xxx  vole  on  June  12, 
1880.  weeding  was  no  longer 


blossom  XXX 
10  am.  Mrs. 
Brown  was 
unhappy 

with  XXX 
dog.  XXX 
yelled  xxx 
XXX  saying 
"You 

impossible 
dog!"  xxx 
xxx  dog  kept 
eating  xxx 
flowers  xxx 
weeds,  xxx 
asked  Mr. 
Cray  xxx 
stop  xxx 
dog.  xxx 
couldn't. 

Mrs.  Brown 
planted 
roses  xxx 
weeded  xxx 
garden,  xxx 
silly  dog  % 
dug  up  xxx 
roses 

looking  xxx 
xxx  vole  on 
June  12, 
1880. 
Weeding 
was  no 
longer 
needed. 

Prof. 

Darren,  Mrs. 
Brown  & 

Mr.  Cray 
met  xxx  next 
day  xxx 
concoct  xxx 
plan.  John 
Darren  xxx 


needed. 

prof,  darren,  mrs.  brown  &  mr. 
Cray  met  xxx  next  day  xxx  concoct 
xxx  plan.  prof_john_darren  xxx 
mrs.  brown  put  up  xxx  scarecrow, 
xxx  thought  xxx  xxx  scare  xxx 
dog.  mr.  craye  put  up  xxx  fence, 
problem  solved,  then  mrs.  brown 
planted  lotus,  carnations,  daffodils, 
xxx  roses. 

Thesaurus  content 
only.  Direct 

Adjacency  .  eating...  yelling . ,.. 

(default  if  Thesaurus  .,...  prof  John_darren . ,„. 

content  only  is 
chosen) 


Thesaurus  content 
only,  Rhetorical 
Adjacency 


xxx.  xxx  xxx  xxx  eating  xxx  xxx 
xxx  xxx  xxx  xxx.  xxx.  xxx  xxx 
xxx  xxx  xxx  xxx.  xxx  yelling  xxx 
xxx  xxx  xxx  xxx  xxx  xxx  xxx  xxx 
xxx  xxx  xxx  xxx  xxx  xxx.  xxx  xxx 
xxx.  xxx  xxx  xxx  xxx  xxx.  xxx 
xxx.  xxx.  xxx  xxx  xxx  xxx  xxx 
xxx  xxx.  xxx  xxx  xxx  xxx  xxx  xxx 
xxx  xxx  xxx  xxx  xxx  xxx  xxx  xxx, 
xxx.  xxx  xxx  xxx  xxx  xxx. 


xxx.  xxx,  xxx.  xxx  xxx.  xxx  xxx 
xxx  xxx  xxx  xxx  xxx  xxx  xxx. 
prof_john_darren  xxx  xxx.  xxx 
xxx  xxx  xxx  xxx.  xxx  xxx  xxx  xxx 
xxx  xxx  xxx.  xxx.  xxx  xxx  xxx 
xxx.  xxx  xxx.  xxx  xxx.  xxx  xxx 
xxx,  xxx,  xxx,  xxx  xxx. 
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Mrs.  Brown 

put  up  XXX 

scarecrow. 
XXX  thought 
XXX  XXX 

scare  xxx 
dog.  Mr. 
Craye  put  up 
xxx  fence. 
Problem 
solved.  Then 
Mrs.  Brown 
planted 
lotus, 
carnations, 
daffodils, 
xxx  roses.. 


Text  Preprocessing 
7.  Meta-Matrix  Thesaurus 

A  Meta-Matrix  Thesaurus  has  to  be  applied  if  Meta-Matrix  Analysis  should  be 
performed. 

A  Meta-Matrix  Thesaurus  associates  concepts  with  meta-matrix  categories: 

•  Agent 

•  Knowledge 

•  Resource 

•  Task/Event 

•  Organization 

•  Location 

•  Action 

•  Role 

•  Attribute 

•  Any  user-defined  category  (as  many  as  the  user  defines) 

When  applying  a  Meta-Matrix  Thesaurus,  AutoMap  searches  the  text(s)  for  the  entries 
specified  in  the  Meta-Matrix  Thesaurus  and  translates  matches  into  related  Meta-Matrix 
categories.  If  you  also  want  to  apply  a  Delete  List  or  /  and  a  Generalization  Thesaurus 
you  will  need  to  apply  these  pre-processing  tools  before  the  Meta-Matrix  Thesaurus. The 
Meta-Matrix  Thesaurus  is  NOT  case  sensitive. 

You  might  also  see  the  meta  matrix  model  as  implemented  in  AutoMap  to  better 
understand  the  meta-matrix. 
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7. 1  Open  a  Meta-Matrix  Thesaurus 

Click  the  File  menu,  select  Open  Meta-Matrix  Thesaurus  and  choose  Open  from  highest 
level  of  pre-processing. 

The  union  of  concepts  after  the  highest  level  of  pre-processing  applied  so  far  will  be 
displayed  in  alphabetical  order  on  P2,  Meta-Matrix  Thesaurus  (tab  no.  7)  index  card. 

Note:  The  Meta-Matrix  Thesaurus  can  be  edited. 

If  you  have  a  pre-defined  Meta-Matrix  Thesaurus  available  that  matches  (some  of)  the 
concepts  contained  in  the  loaded  Meta-Matrix  Thesaurus  you  can  open  this  file. 

To  do  so,  click  the  File  menu,  select  Open  Meta-Matrix  Thesaurus  and  choose  Open  from 
file. 

If  a  concept  contained  in  the  pre-defined  file  matches  a  concept  in  the  currently  opened 
Meta-Matrix  Thesaurus  the  meta-matrix  categories  assigned  to  this  concept  in  the  pre¬ 
defined  file  will  be  automatically  assigned  to  the  concept  in  the  currently  opened  Meta- 
Matrix  Thesaurus. 

Note:  The  pre-assigned  Meta-Matrix  Thesaurus  can  be  edited. 

7.2  Edit  a  Meta-Matrix  Thesaurus 

To  each  concept  that  appears  in  the  Concept  column  of  the  Meta-Matrix  Thesaurus  you 
can  assign  special  categories: 

•  One  or  multiple  meta-matrix  categories  to  a  concept  by  checking  these 
categories.  To  unselect  an  assignment,  uncheck  the  meta-matrix  categories. 

•  Self-defined  meta-matrix  categories  (e.g.  location  for  Copenhagen).  To  do 
this,  enter  the  category  you  wish  to  define  in  the  last  column  (User-defined) 
of  the  Meta-Matrix  Thesaurus.  You  can  define  as  many  categories  as  you  want 
to.  You  can  assign  a  concept  to  multiple  self-defined  categories.  In  this  case, 
you  will  need  a  single  row  for  each  assignment.  Hit  enter  when  you  have 
finished  a  self-defined  row.  To  unselect  an  assignment,  uncheck  the  meta¬ 
matrix  categories. 

•  No  meta-matrix  category  to  a  concept.  In  this  case,  this  concept  will  not  be 
considered  for  meta-matrix  pre-processing  and  Meta-Matrix  Analysis. 

7.3  Buiid  a  Meta-Matrix  Thesaurus 

You  can  build  a  Meta-Matrix  Thesaurus  outside  of  AutoMap  by  using  a  text  editor. 

Please  eonsider  these  instruetions: 

1.  Every  line  contains  (Key)  Concept  /  Meta-Matrix  category  /  Meta-Matrix 
category/....  You  can  assign  a  (key)  concept  to  as  many  Meta-Matrix 
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categories  as  you  want  to.  If  a  concept  that  appears  in  the  Meta-Matrix 
Thesaurus  written  by  you  does  not  appear  in  the  Concepts  column  of  the 
Meta-Matrix  Thesaurus  this  will  not  result  in  a  conflict.  AutoMap  automatically 
searches  for  concepts  contained  in  the  Concepts  column  of  the  Meta-Matrix 
Thesaurus  and  when  it  finds  a  concept  that  also  appears  in  your  manually 
built  Thesaurus  it  assigns  the  categories  that  you  assigned  to  this  concept  to 
the  concept  on  the  index  card. 

2.  Avoid  empty  lines. 

3.  Make  sure  to  separate  the  words  by  a  slash. 

4.  The  Meta-Matrix  Thesaurus  is  NOT  case  sensitive. 

5.  Save  the  Meta-Matrix  Thesaurus. 

6.  Open  the  Meta-Matrix  Thesaurus  in  AutoMap. 

7.  You  can  edit  the  Meta-Matrix  Thesaurus  in  AutoMap  if  you  wish. 

7 A  Apply  a  Meta-Matrix  Thesaurus 

Meta-matrix  pre-processing  is  a  higher  level  of  pre-processing  than  the  application  of  a 
Delete  List  and  a  Generalization  Thesaurus.  Thus,  if  you  also  want  to  apply  a  Delete  List 
or  /  and  a  Generalization  Thesaurus  you  will  need  to  apply  these  pre-processing  tools 
before  the  Meta-Matrix  Thesaurus. 

Follow  this  process: 

1.  Decide  if  you  want  to  use  the  Thesaurus  content  only  option  or  not. 

If  you  do  not  select  the  Thesaurus  content  only  option  this  setting  will  not  be 
applied. 

2.  If  you  select  the  Thesaurus  content  only  option  you  can  choose  an  adjacency 
option. 

3.  Adjacency  can  be  either  direct  (default)  or  rhetorical. 

If  you  do  not  change  the  adjacency  option,  no  adjacency  option  will  be 
applied. 

4.  To  apply  your  Meta-Matrix  Thesaurus  with  the  settings  you  have  specified 
click  the  Apply  Thesaurus  button  on  the  Meta-Matrix  Thesaurus  Index  card. 
AutoMap  uses  the  entries  in  the  Thesaurus  to  search  the  text(s)  for  concepts. 
If  a  match  is  found  it  will  be  translated  into  a  Meta-Matrix  category.  The 
Thesaurus  is  case  insensitive. 

5.  See  the  pre-processed  texts  on  the  PI,  Meta-Matrix  Thesaurus  (tab  no.  5). 
Meta-Matrix  categories  are  confined  by  tags  (e.g.,  <agent>)  .  If  multiple 
Meta-Matrix  categories  were  assigned  to  a  concept  these  categories  appear 
tagged  (e.g.,  if  police  was  assigned  to  agent  and  resource,  than  the 
translated  text  would  look  like  this:  <agent,  resource>). 

7A.  1  Thesaurus  content  only 

If  the  Thesaurus  content  only  option  is  chosen  AutoMap  does  the  following: 

1.  Search  the  text(s)  for  concepts  specified  in  the  Concept  column  of  the  Meta- 
Matrix  Thesaurus. 

2.  Translate  matches  into  related  meta-matrix  categories. 
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3.  Maintain  oniy  meta-matrix  categories  in  the  pre-processed  texts.  The  rest  of 
the  text  is  dropped  and  wiii  not  be  considered  for  further  pre-processing  or 
anaiysis.  Punctuation  marks  iike  the  end  of  sentences  and  paragraphs  are 
maintained  and  considered  for  anaiysis. 

To  select  the  Thesaurus  content  only  option  check  the  Thesaurus  content  only  item  on  P2, 
Meta-Matrix  Thesaurus  (tab  no.  7).  You  can  now  choose  to  either  direct  (default)  or 
rhetorical  adjacency.  Then  apply  the  Meta-Matrix  Thesaurus.  In  order  to  switch  from 
using  the  Thesaurus  content  only  option  uncheck  the  Thesaurus  content  only  item  and 
apply  the  Thesaurus  again. 

If  the  Thesaurus  content  only  option  is  NOT  chosen  AutoMap  does  the  following: 

1.  Search  the  text(s)  for  concepts  specified  in  the  thesaurus. 

2.  Transiate  matches  into  meta-matrix  categories. 

3.  Keep  the  rest  of  the  text  as  it  is.  This  means,  aii  other  concepts  in  the  text 
that  did  not  match  concepts  specified  in  the  thesaurus  wiii  not  be  affected  at 
aii.  Originai  distances  of  both  unaffected  concepts  and  meta-matrix  categories 
wiii  be  maintained.  This  ruie  does  not  appiy  if  a  concept  consisting  of  more 
than  one  word  is  transiated  into  a  meta-matrix  category. 

AutoMap  by  default  does  not  select  the  Thesaurus  content  only  option.  Therefore,  the 
Thesaurus  content  only  item  on  P2,  Meta-Matrix  Thesaurus  (tab  no.  7)  by  default  is  not 
checked.  Just  apply  the  Meta-Matrix  Thesaurus.  In  order  to  switch  to  using  the  Thesaurus 
content  only  option  check  the  Thesaurus  content  only  item  on  P2,  Meta-Matrix  Thesaurus 
(tab  no.  7)  index  card  and  then  apply  the  Thesaurus  again. 

7 A.  1. 1  Direct  Adjacency 

Direct  adjacency  means  that  original  distances  of  concepts  that  represent  the  key  concepts 
will  neither  be  visualized  nor  considered  for  analysis. 

To  choose  the  direct  adjacency  click  the  Direct  button  in  the  Adjacency  field  on  P2, 
Meta-Matrix  Thesaurus  index  card  (tab  no.  7).  Then  apply  the  Meta-Matrix  Thesaurus.  If 
the  user  does  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  for 
analysis.  As  a  result,  only  meta-matrix  categories  are  displayed  on  PI,  Meta-Matrix 
Thesaurus  (tab  no.  5)  will  be  considered  for  analysis.  All  meta-matrix  tags  in  the  resulting 
text  appear  directly  adjacent  to  each  other. 

7.4. 1.2  Rhetorical  Adjacency 

Rhetorical  adjacency  can  only  be  applied  if  the  Thesaurus  content  only  option  was  not 
chosen. 

To  choose  the  rhetorical  adjacency  click  the  Rhetorical  button  in  the  Adjacency  field  on 
P2,  (tab  no.  7)  Meta-Matrix  Thesaurus.  Then  apply  the  Meta-Matrix  Thesaurus.  If  the 
user  does  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  for  analysis. 
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As  a  result,  the  meta-matrix  tags  and  the  rest  of  the  text  are  displayed  on  PI,  (tab  no.  5) 
Meta-Matrix  Thesaurus  and  will  be  considered  for  analysis.  Original  distances  of  meta¬ 
matrix  tags  that  represent  the  key  concepts  will  be  visually  symbolized  by  placeholders 
(xxx)  and  considered  for  analysis. 

7.5  Un-Apply  a  Meta-Matrix  Thesaurus 

To  un-apply  a  Meta-Matrix  Thesaurus  that  was  applied  to  the  data,  go  to  P2,  (tab.  no.  7) 
Meta-Matrix  Thesaurus  and  hit  the  Un- Apply  button.  The  tab  no.  5  Meta-Matrix 
Thesaurus  on  PI  will  be  cleared. 

7.6  Save  an  applied  Meta-Matrix  Thesaurus 

If  you  wish  to  save  a  Meta-Matrix  Thesaurus  you  first  need  to  apply  it.  To  save  the 
Thesaurus,  click  the  File  menu,  select  Save  Meta-Matrix  Thesaurus  as.  A  file  chooser 
will  pop  up. 

7.7  Save  text(s)  after  application  of  Meta-Matrix  Thesaurus 

To  save  the  text(s)  after  the  application  of  the  Meta-Matrix  Thesaurus,  click  the  File 
menu,  select  Save  Text(s)  after  Meta-Matrix  Thesaurus  applied.  All  texts  are 
automatically  saved  in  a  folder  called  "preprocessed"  in  the  root  directory  of  AutoMap. 
The  filename  will  be  after_MMCatThes_NameOfYourText.txt". 

7.8  Example  for  editing  and  applying  a  Meta-Matrix  Thesaurus 

An  Extract  from  the  Our  Text  I.txt  was  used  as  input: 

Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs.  Brown  was  unhappy  with 
the  dog.  She  yelled  at  it  saying  "You  impossible  dog!"  But  the  dog  kept  eating  the 
flowers  and  weeds.  She  asked  Mr.  Cray  to  stop  the  dog.  He  couldn't.  Mrs.  Brown 
planted  roses  and  weeded  the  garden.  The  silly  dog  %  dug  up  the  roses  looking  for  a 
vole  on  June  12,  1880.  Weeding  was  no  longer  needed. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  the  next  day  to  concoct  a  plan.  John  Darren 
and  Mrs.  Brown  put  up  a  scarecrow.  She  thought  it  would  scare  the  dog.  Mr.  Craye 
put  up  a  fence.  Problem  solved.  Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils, 
and  roses. 

The  customized  extensive  Delete  List  was  applied  to  this  text.  The  resulting  text  looks 
like  this: 

Mr.  Cray's  brown  dog  ate  lotus  blossom  10  am.  Mrs.  Brown  was  unhappy  with  dog. 
yelled  saying  "You  impossible  dog!"  dog  kept  eating  flowers  weeds,  asked  Mr.  Cray 
stop  dog.  couldn't.  Mrs.  Brown  planted  roses  weeded  garden,  silly  dog  %  dug  up 
roses  looking  vole  on  June  12,  1880.  Weeding  was  no  longer  needed. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  next  day  concoct  plan.  John  Darren  Mrs. 
Brown  put  up  scarecrow,  thought  scare  dog.  Mr.  Craye  put  up  fence.  Problem  solved. 
Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils,  roses. 
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Then,  open  the  Meta-Matrix  Thesaurus  by  clicking  the  File  menu,  selecting  Open  Meta- 
Matrix  Thesaurus,  and  choosing  Open  from  highest  level  of  pre-processing.  The  black 
ellipse  in  the  screen  shot  below  underscore  how  to  open  a  Meta-Matrix  Thesaurus  from 
the  file  menu. 


Op)en  Sub-Matrix  Selection 
Create  and  Refresh  Union  Concept  List 
Output  Storage  Manager 
.‘»J  Save  CoiKept  List  per  Text 


Open  Thesaurus  for  notfiMliang  abbreviations  (English) 
\S  Open  Thesaurus  for  normalizrg  typos  (Engfcsh) 

Open  Thesaurus  for  normalizing  contractions  (English) 
Open  Thesaurus  for  generalizing  countries 


^  Save  Union  Concept  List 
Save  applied  Delete  List 
Save  applied  Generatzation  Thesaurus 
i!^  Save  applied  Meta-Matrix  Thesaurus 
Save  Sub-Matnx  Selection 
Save  Text(s)  after  Stemmer  appfed 
,I»J  Save  Text(s)  after  Delete  List  applied 
Jjj  Save  Text(s)  after  Generatzation  Thesaixus  applied 
Save  Text(s)  after  Meta-Matnx  Thesaurus  appfced 
jij  Save  Text(s)  after  Sub-Matrix  selection  appted 


The  union  of  concepts  from  the  highest  level  of  pre-processing  will  be  displayed  in 
alphabetical  order  on  P2,  tab  no.  3  -  Preprocessing  Settings,  tab  no.  5  -  Meta-Matrix 
Thesaurus.  As  we  did  not  pre-process  the  text  the  original  input  sentence  is  used  for  input 
for  the  Meta-Matrix  Thesaurus. 
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1 .  Concept  List 

2.  Union  Concept  List 

3.  Pre-Processing  Settings 

4.  Analysis  Settings 

_ 

1 .  Utilities 

2.  Stemming 

4.  Generalization  Thesaurus 


5.  Meta-Matrix  Thesaurus 


Select  Sub-Matrix 

knowledge 

agent 

resource 

task 

event 

organization 

location 

role 

action 


_s 

Adjacency 
0  Direct 


>  Add  new  line 


<  Remove  line 


O  Rhetorical 


>  Add  in  same  line 


6.  Sub-Matrix  Selection 


Apply 


Un-Apply 


Furthermore,  we  have  prepared  a  Meta-Matrix  Thesaurus  that  we  stored  on  our  machine. 
This  file  looks  like  this: 


•  Mr_Cray/agent 

•  Mrs_Brown/agent 

•  Prof_John_Darren/agent 

•  dog/agent 

•  flowers/resource 

•  lotus/resource 

•  roses/resource 

•  carnations/resource 

•  daffodils/resource 

•  weeds/resource 

•  weeds/task 

•  planting/task 

•  eating/task 

•  yelling/task 

•  met/task 

We  clicked  the  File  menu,  selected  Open  Meta-Matrix  Thesaurus  and  chose  Open  from 
file. 


AutoMap  searched  the  opened  Meta-Matrix  Thesaurus  for  the  words  contained  in  the 
prepared  Thesaurus.  When  it  found  a  match  it  assigned  the  words  in  the  opened 
Thesaurus  the  Meta-Matrix  categories  that  were  assigned  to  the  same  concept  in  the  pre¬ 
defined  file.  Below  is  the  result: 
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1 .  Concept  List  2.  Union  Concept  Lis^  3-  Pre-Processing  Settngs  Analysis  Settings 

l.Utities  2.  Stemming  3.  Delete  List  4.  Generabation  Ttiesauru^  5- Met^'Matrix  Thesaurus  6.  Sub-Matrix  Selection 


Now  we  edit  the  Thesaurus  by  modifying  some  of  the  pre- assignments  (e.g.,  centre)  and 
adding  assignments  for  concepts  not  assigned  to  Meta-Matrix  categories  yet  (e.g., 
contact,  Copenhagen).  Not  all  concepts  were  associated  with  Meta-Matrix  categories 
(e.g.,  mobile). 


i .  Concept  List  2.  Urvon  Concept  list  3-  Pre-Processing  Settings  j  4.  Analysis  Setbngs 


1 .  Utilities  2.  Stemming  3.  Delete  List  4.  Generalization  Thesaurus  5.  Meta-Matrix  Thesaurus  6.  Sub-Matrix  Selection 


Concept 


krvov>iledge  agent 


task 


event 


organ!.. 


location  role 


aay 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

aif/i 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

nn 

craye 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

□ 

daffodls 

□ 

□ 

a 

□ 

d 

□ 

d 

d 

darren 

□ 

□ 

□ 

Q 

n 

n 

day 

n 

□ 

□ 

n 

n 

n 

n 

n 

Idog 

■ 

□ 

■ 

■ 

■ 

■  ■  ■ 

dog!“ 

□ 

n 

□ 

,  □ 

.  □  j 

□ 

□ 

Ajg 

P 

n 

_ L_  ; 

□  1 

I  n 

y 

eating 

R 

y 

1  n 

□ 

’  ~n 

□ 

fence 

□ 

□ 

□ 

.  .□ 

'  □ 

□ 

fkWrers 

□ 

□ 

y 

1  □ 

□ 

I  □ 

□ 

Ad)acency 


[~~)  Thesaurus  content  only 


Apply 


©Direct  [  Un-Apply 


B 


In  the  next  step  we  applied  the  Apply  Meta-Matrix  Thesaurus  with  the  following  settings: 


Setting 

Not  Thesaurus 
content  only 
(default) 


Resulting  text 

mr  .  Cray's  brown  <  agent>  <  task>  the  <  resource>  blossom 
at  10  am  .  mrs  .  brown  was  unhappy  with  the  <  agent>  .  she 
<  task>  at  it  saying  "you  impossible  dog!"  but  the  <  agent> 
kept  <  task>  the  <  resource>  and  <  task>  .  she  asked  mr  . 
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Cray  to  stop  the  <  agent>  .  he  couldn't  .  mrs  .  brown  planted  < 
resource>  and  weeded  the  garden  .  the  silly  <  agent>  %  dug 
up  the  <  resource>  looking  for  a  vole  on  June  12  ,  1880  . 
weeding  was  no  longer  needed  .  prof  .  darren  ,  mrs  .  brown  & 
mr .  Cray  <  task>  the  next  day  to  concoct  a  plan  .  <  agent> 
and  mrs  .  brown  put  up  a  scarecrow  .  she  thought  it  would 
scare  the  <  agent>  .  mr  .  craye  put  up  a  fence  .  problem 
solved  .  then  mrs  .  brown  planted  <  resource>  ,  <  resource>  , 
<  resource>  ,  and  <  resource>  . 


Thesaurus  content 
only,  Direct 
Adjacency 

(default  if  Thesaurus 
content  only  is 
chosen) 


.  <  agent>  <  task>  <  resource>  .  .  <  agent>  .  <  task>  < 
agent>  <  task>  <  resource>  <  task>  .  .  <  agent>  .  .  .  < 

resource>  .  <  agent>  <  resource> . <  task>  .  <  agent> 

.  .  <  agent> . <  resource>  <  resource>  <  resource>  < 

resource>  . 


XXX  .  XXX  XXX  <  agent>  <  task>  xxx  <  resource>  xxx  xxx  xxx 
XXX  .  xxx  .  xxx  xxx  xxx  xxx  xxx  <  agent>  .  xxx  <  task>  xxx 
xxx  xxx  xxx  xxx  xxx  xxx  xxx  <  agent>  xxx  <  task>  xxx  < 
resource>  xxx  <  task>  .  xxx  xxx  xxx  .  xxx  xxx  xxx  xxx  < 
Thesaurus  content  agent>  .  xxx  xxx  .  xxx  .  xxx  xxx  <  resource>  xxx  xxx  xxx  xxx 

only,  Rhetorical  .  xxx  xxx  <  agent>  xxx  xxx  xxx  xxx  <  resource>  xxx  xxx  xxx 

Adjacency  xxx  xxx  xxx  xxx  xxx  .  xxx  xxx  xxx  xxx  xxx  .  xxx  xxx  xxx  .  xxx 

xxx  .  xxx  xxx  xxx  .  xxx  <  task>  xxx  xxx  xxx  xxx  xxx  xxx  xxx  . 
<  agent>  xxx  xxx  .  xxx  xxx  xxx  xxx  xxx  .  xxx  xxx  xxx  xxx  xxx 
xxx  <  agent>  .  xxx  .  xxx  xxx  xxx  xxx  .  xxx  xxx  .  xxx  xxx  .  xxx 
xxx  <  resource>  <  resource>  <  resource>  xxx  <  resource>  . 


Text  Preprocessing 

8.  Sub-Matrix  Selection 

The  Sub-Matrix  Selection  enables  the  user  to  re-translate  concepts  represented  by  a  Meta- 
Matrix  category  in  order  to  run  Sub-Matrix  Analysis.  If  input  texts  (no  matter  if  they 
were  pre-processed  with  a  Delete  list  or  not)  were  used  in  order  to  generate  the  Concept 
List  for  the  Meta-Matrix  Thesaurus,  concepts  represented  by  a  Meta-Matrix  category  will 
be  translated  into  text- level  concepts.  If  input  texts  (no  matter  if  they  were  pre-processed 
with  a  Delete  list  or  not)  were  pre-processed  with  a  Generalization  Thesaurus  before 
applying  the  Meta-Matrix  Thesaurus,  concepts  represented  by  a  Meta-Matrix  category 
will  be  translated  into  key  concepts. 

The  Thesaurus  content  only  option  always  automatically  applies  for  the  Sub-Matrix 
Selection. 
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8. 1  Select  Sub-Matrix  Categories! 

Precondition:  Sub-Matrix  Selection  can  only  be  performed  if  Meta- 
Matrix  Thesaurus  was  applied.  There  are  4  ways  to  select  sub 
matrices: 

8.1.1  If  you  do  not  have  a  sub  matrix  selection  file  available 

Create  and  modify  a  sub  matrix  selection. 

8. 1.2  If  you  do  not  have  a  sub  matrix  selection  file  available  and 
want  to  select  the  full  meta  matrix  (means  all  cells  in  the  meta 
matrix) 

Go  to  the  File  menu,  Select  Open  Sub-Matrix  Selection,  click  on  Select  Full  Meta  Matrix. 
AutoMap  dynamically  generates  all  combinations  of  meta  matrix  categories  as  specified 
in  the  meta  matrix  thesaurus,  including  user-defined  categories,  that  represent  all  cells  of 
the  meta  matrix  and  display  these  combinations  in  the  left  window  on  P2,  8.  Sub-Matrix 
Selection. 

You  can  modify  this  sub  matrix  selection. 

This  is  the  full  meta- matrix: 

•  agent/agent/knowledge/organization/task- 
event/resource/location/role/action/attribute 

•  knowledge/agent/knowledge/organization/task- 
event/resource/location/role/action/attribute 

•  organization/agent/knowledge/organization/task- 
event/resource/location/role/action/attribute 

•  task-event/agent/knowledge/organization/task- 
event/resource/location/role/action/atthbute 

•  resource/agent/knowledge/organization/task- 
event/resource/location/role/action/attribute 

•  location/agent/knowledge/organization/task- 
event/resource/location/role/action/attribute 

•  role/agent/knowledge/organization/task- 
event/resource/location/role/action/attribute 

•  action/agent/knowledge/organization/task- 
event/resource/location/role/action/attribute 

•  attribute/agent/knowledge/organization/task- 
event/resource/location/role/action/attribute 

8.1.3  If  you  have  a  sub  matrix  selection  file  available 

Go  to  the  File  menu.  Select  Open  Sub-Matrix  Selection,  click  on  Open  from  file. 
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You  can  modify  this  sub  matrix  selections. 


8.1.4  If  you  want  to  write  your  own  sub  matrix  selection  file  and 
load  it  into  AutoMap: 

Build  your  own  sub  matrix  selection  file  outside  of  AutoMap,  save  it  in  .txt  format,  and 
load  it  into  AutoMap. 

Go  to  the  File  menu.  Select  Open  Sub-Matrix  Selection,  click  on  Open  from  file.  You  can 
modify  this  sub  matrix  selections. 

8.2  Create  or  Modify  Sub-Matrix  Selection 

Go  to  P2,  8.  Sub-Matrix  Selection,  click  on  a  Sub-Matrix  category  you  wish  to  select  and 
hit  the  >  Add  in  same  line  button.  The  selected  category  appears  in  the  right  text  field  on 
P2,  (tab  no.  8)  Sub-Matrix  Selection. 

You  can  select  as  many  Sub-Matrix  categories  per  row  as  you  wish  by  clicking  on  a 
category  in  the  left  window  on  P2,  tab  no.  8  Sub-Matrix  Selection  and  move  it  to  the  right 
window  by  clicking  the  >  Add  in  same  line  button.  Additionally,  you  can  select  as  many 
rows  of  sub  matrix  selections  as  you  wish  by  clicking  on  a  category  in  the  left  window  on 
P2,  tab  no.  8  Sub-Matrix  Selection  and  move  it  to  the  right  window  by  clicking  the  >  Add 
in  new  line  button.  To  add  a  further  category  to  a  new  line,  first  single  click  on  this 
category  in  the  right  window  on  P2,  tab  no.  8  Sub-Matrix  Selection,  so  that  the  category 
is  highlighted  in  blue.  To  unselect  a  selected  Sub-Matrix  category,  click  on  the  row  in  the 
right  window  and  hit  the  <  Remove  line  button  on  P2,  tab  no.  8  Sub-Matrix  Selection. 

The  row  will  disappear  from  the  right  window. 

Tip! 


•  If  you  wish  to  analyze  any  relation  between  e.g.  agents  and 
organizations,  you  need  to  select  agent/  organization  in  one 
row,  and  organization/  agent  in  another  row.  If  you  wish  to 
analyze  any  relations  among  all  agents,  select  agent/agent. 

•  If  you  wish  to  analyze  any  relations  among  all  agents  and  their 
relation  with  all  organizations,  select  agent/agent/organization 
and  in  another  row  organization/agent/agent/. 

8.3  Apply  Sub-Matrix  Selection 

1.  The  Thesaurus  content  only  option  always  automatically  applies  for  the  Sub- 
Matrix  Selection.  Select  an  adjacency  option.  Adjacency  can  be  either  direct 
(default)  or  rhetorical. 

If  you  do  not  change  the  adjacency  option,  direct  adjacency  will  be  applied. 

2.  To  apply  your  Meta-Matrix  Thesaurus  with  the  settings  you  have  specified 
click  the  Apply  Sub-Matrix  Selection  button  on  the  Sub-Matrix  Selection  Index 
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card. 

AutoMap  uses  the  entries  in  the  Thesaurus  to  search  the  text(s)  for  meta¬ 
matrix  tags.  If  a  match  is  found  it  will  be  translated  into  the  related  concept. 
The  Thesaurus  IS  case  insensitive. 

3.  See  the  pre-processed  texts  on  the  PI,  tab  no.  6  Sub-Matrix  Text. 

8.3. 1  Direct  Adjacency 

Direct  adjacency  means  that  original  distances  of  concepts  that  represent  meta- matrix 
categories  will  neither  be  visualized  nor  considered  for  analysis. 

To  choose  the  direct  adjacency  click  the  Direct  button  in  the  Adjacency  field  on  P2,  tab 
no.  8  Sub-Matrix  Selection  index  card.  Then  apply  the  Sub-Matrix  Selection. 

If  the  user  does  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  for 
analysis. 

As  a  result,  only  concepts  that  represent  meta-matrix  categories  are  displayed  on  PI,  tab 
no.  6  Sub-Matrix  Text  index  card  and  will  be  considered  for  analysis.  All  concepts  in  the 
resulting  text  appear  directly  adjacent  to  each  other. 

8.3.2  Rhetorical  Adjacency 

To  choose  the  rhetorical  adjacency  click  the  Rhetorical  button  in  the  Adjacency  field  on 
P2,  tab  no.  8  Sub-Matrix  Selection  index  card.  Then  apply  the  Sub-Matrix  Selection.  If 
the  user  does  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  for 
analysis. 

As  a  result,  concepts  that  represent  meta-matrix  categories  are  displayed  on  PI,  tab  no.  6 
Sub-Matrix  Text  and  will  be  considered  for  analysis.  Original  distances  of  concepts  that 
represent  meta-matrix  categories  will  be  visually  symbolized  by  placeholders  (xxx)  and 
considered  for  analysis. 

8.4  Un-Apply  a  Sub-Matrix  Selection 

To  un-apply  a  Sub-Matrix  Selection  that  was  applied  to  the  data,  go  to  P2,  tab  no.  8  Sub- 
Matrix  Selection  index  card  and  hit  the  Un- Apply  button.  The  tab  no.  6  Sub-Matrix  Text 
on  PI  will  be  cleared. 

8.5  Save  Sub-Matrix  Selection 

Apply  the  Sub-Matrix  Selection  before  you  save  it. 

To  save  a  Sub-Matrix  Selection  (the  content  of  the  right  window  on  P2,  tab  no.  8  Sub- 
Matrix  Selection,  click  the  File  menu,  select  Save  applied  Sub-Matrix  Selection  as 
applied. 
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8.6  Save  text(s)  after  Sub-Matrix  Selection 


To  save  the  text(s)  after  the  application  of  the  Sub-Matrix  Selection,  click  the  File  menu, 
select  Save  Text(s)  after  Sub-Matrix  Selection  applied.  All  texts  are  automatically  saved 
in  a  folder  called  "preprocessed"  in  the  root  directory  of  AutoMap.  The  filename  will  be 
after_S  ubMatrixSelection_N  ameOfY  ourT  ext .  txt " . 

8.7  Example  for  Sub-Matrix  Selection 

The  Sub-Matrix  Selection  as  shown  here  is  based  on  the  example  for  the  Meta-Matrix 
Thesaurus. 

We  opened  P2,  tab  no.  3  -  Pre-Processing  Settings,  tab  no.  6-  Sub-Matrix  Selection. 


1 .  Concept  List  2.  Union  Concept  List  3.  Pre-Processing  Settings 

4.  Analysis  Settings 

futilities  2.  Stemming 

3.  Delete  List 

Select  Sub-Matrix 

knowledge 

agent 

resource 


action 

attribute 


Adjacency 

0  Direct  O  Rhetorical 


knowledge 

agent 

resource 

event 


Apply 


Un-Apply 


Then  we  applied  the  Sub-Matrix  Selection  with  the  following  settings: 


Setting  Resulting  text 

Select: 

Agent/  Location/  Action 

Location/ Agent/ Action  .  dog.,  dog.  dog.,  dog....  dog,..  .,...  prof_John_darren.. 

Action/  Location/  Agent  dog . ,,,. 

Direct  Adjacency 
(default) 


Select: 

Agent/  Location/  Action 
Location/  Agent/  Action 
Action/  Location/  Agent 
Rhetorical  Adjacency 


XXX.  XXX  XXX  dog  XXX  XXX  XXX  XXX  XXX  XXX  XXX.  XXX.  XXX 
XXX  XXX  XXX  XXX  dog.  XXX  XXX  XXX  XXX  XXX  XXX  XXX  XXX 
XXX  XXX  dog  XXX  XXX  XXX  XXX  XXX  XXX.  XXX  XXX  XXX.  XXX 
XXX  XXX  XXX  dog.  XXX  XXX.  XXX.  XXX  XXX  XXX  XXX  XXX  XXX 
XXX.  XXX  XXX  dog  XXX  XXX  XXX  XXX  XXX  XXX  XXX  XXX  XXX 
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(see  also  next  picture)  xxx  xxx  xxx,  xxx.  xxx  xxx  xxx  xxx  xxx.  xxx 

XXX 


Select: 

Full  Meta-matrix 
Direct  Adjacency 
(default) 


Select: 

Full  Meta-matrix 
Rhetorical  Adjacency 
(default) 


xxx.  xxx,  xxx.  xxx  xxx  xxx.  xxx  xxx  xxx  xxx  xxx  xxx  xxx 
xxx  xxx.  profjohn_darren  xxx  xxx.  xxx  xxx  xxx  xxx  xxx. 
xxx  xxx  xxx  xxx  xxx  xxx  dog.  xxx.  xxx  xxx  xxx  xxx.  xxx 
xxx.  xxx  xxx.  xxx  xxx  xxx,  xxx,  xxx,  xxx  xxx. 

.  dog  eating  lotus.,  dog.  yelling  dog  eating  flowers 
weeds.,  dog...  roses,  dog  roses,..  met. 

prof_John_darren..  dog .  lotus,  carnations,  daffodils, 

roses. 

xxx.  xxx  xxx  dog  eating  xxx  lotus  xxx  xxx  xxx  xxx.  xxx. 
xxx  xxx  xxx  xxx  xxx  dog.  xxx  yelling  xxx  xxx  xxx  xxx  xxx 
xxx  xxx  xxx  dog  xxx  eating  xxx  flowers  xxx  weeds,  xxx 
xxx  xxx.  xxx  xxx  xxx  xxx  dog.  xxx  xxx.  xxx.  xxx  xxx  roses 
xxx  xxx  xxx  xxx.  xxx  xxx  dog  xxx  xxx  xxx  xxx  roses  xxx 
xxx  xxx  xxx  xxx  xxx  xxx,  xxx.  xxx  xxx  xxx  xxx  xxx.  xxx 

xxx 

xxx.  xxx,  xxx.  xxx  xxx  xxx.  xxx  met  xxx  xxx  xxx  xxx  xxx 
xxx  xxx.  profjohn_darren  xxx  xxx.  xxx  xxx  xxx  xxx  xxx. 
xxx  xxx  xxx  xxx  xxx  xxx  dog.  xxx.  xxx  xxx  xxx  xxx.  xxx 
xxx.  xxx  xxx.  xxx  xxx  lotus,  carnations,  daffodils,  xxx 
roses. 


Detailed  example  for  Sub-Matrix  Selection: 


1 ,  Concept  List 

2.  Union  Concept  List 

3.  Pre-Processing  Settings 

4.  Analysis  Settings 

1 .  Utilities 

4.  Generalization  Thesaurus 

5.  Meta-Matrix  Thesaurus 

i  6.  Sub-Matrix  Selection 

Select  Sub-Matrix 


agent/location/action 

[location/agent/action 


I  action/location/agent 


>  Add  new  line 
>  Add  in  same  line 
<  Remove  line 


Adjacency 

O  Direct  0  Rhetorical 


Apply 


Un- Apply 


knowledge 


agent 


resource 

task 

event 

organization 

location 

role 

action 

attribute 


2007  AutoMap  Users  Guide  -  Page  64 


statement  Formation  Choices 


Statement  formation  choices  can  be  made  after  pre-processing  data  and  before  running 
analysis.  These  choices  define  if,  how,  and  where  concepts  will  be  linked.  Statement 
formation  settings  will  be  applied  to  the  highest  level  of  data  pre-processing  that  was 
applied.  If  no  pre-processing  was  performed,  statement  formation  settings  apply  to  the 
original  input  text.  If  the  user  does  not  modify  the  statement  formation  settings  AutoMap 
uses  a  set  of  default  settings. 

1.  Analysis  Settings 

To  specify  the  Analysis  Settings  or  make  the  Analysis  Settings,  use  the  Analysis  Settings 
Index  Card. 

Your  settings  will  be  automatically  applied  in  the  analysis.  You  do  not  need  to  confirm 
them. 

If  you  do  not  want  to  change  any  of  the  suggested  options  a  set  of  standard  settings  will 
be  applied.  The  screen  shot  below  shows  an  example  of  the  Automap  Analysis  Settings 
tab  in  P2. 


1 .  Concept  List  2. 
1 .  Analysis  Settings 
Coding  Ties 


Union  Concept  List 


2.  Output  Options 


3. 


Pre-Processing  Settings^ 


Analysis  Settings 


Directionality  Strength 

0  Uni-directional  0 

O  Bi-directional 


Windowing 

Punctuation 

0  Ingnore  punctuation  completely 
O  Reset  window  at  the  end  of  pharagraphs  only 
O  Reset  window  at  the  end  of  pharagraphs  and  sentences 


Window-Size 

2  ^ 

Find  size  of  iargest  text  in  text  set 


Overview  on  the  possible  Settings: 


Coding  Ties 

Specify  the  way 
statements  are 
counted. 


Directionality 

Select  one  of  the 
following  two  possibilities 
by  checking  the  button. 


Uni-Directional 

(When  coding  a  tie,  only  lst->2nd 
concept  should  be  noted) 

Bi-Directional  (When  coding  a  tie,  both 
1st  <->  2nd  concept  shall  be  noted) 


Strength  Frequency  (The  cumulative  frequency  of 

Strength  will  be  printed  every  existing  statement.) 
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out  by  default.  To  not 
print  out  Strength 
uncheck  the  item. 


Windowing 

Windowing  is  a 
method  that 
codes  the  text 
as  a  map  by 
placing 
relationships 
between  pairs  of 
Concepts  that 
occur  within  a 
window. 


Punctuation 

Select  one  option  by 
checking  the  radio 
button. 


Window  Size 

Select  one  window  size 
by  using  the  number 
chooser. 


Item  not  checked:  Existence  of 
frequency  will  be  printed  out  (binary 
result). 

Ignore  punctuation  completely 

(Statements  will  be  placed  between  all 
concepts.) 

Reset  window  at  end  of  paragraphs 
only  (Statements  will  be  placed  only 
within  every  single  paragraph.) 

Reset  window  at  end  of  paragraphs 
and  sentences  (Statements  will  be 
placed  only  within  every  single  sentence.) 

Window  size  between  2  and  100. 

The  Window  Size  defines  how  distant 
concepts  can  be  and  still  have  a 
relationship.  Only  concepts  in  same 
window  can  form  statements. 


1. 1  Default  Settings 

If  you  do  not  want  to  change  any  of  the  suggested  options  the  analysis  will  be  done  with 
the  following  Standard  Settings: 


Anaphora  Resolution 

Automap  can  perform  anaphora  resolution. 

Anaphora  is  a  linguistic  instance  of  a  grammatical  expression  referring  to  another.  In  a 
general  linguistic  sense,  an  anaphoric  expression  is  one  represented  by  some  kind  of 
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deictic,  a  process  whereby  words  or  expressions  rely  absolutely  on  context.  Sometimes 
this  context  needs  to  be  identified. 

The  anaphora  resolution  function  is  a  pre-processing  setting  and  can  found  in  Panel  P2, 
Tab  no.  3.  Pre-processing  settings  in  Automap's  main  interface.  The  screen  shot  below 
shows  where  the  tool  is  located. 


This  tool  extracts  network  information  from  text  that  requires  additional  textual 
references  from  within  a  certain  phrase  or  grammatical  construct. 

To  Access  Anaphora  Resolution:  Panel  P2  >  Tab  no.  3  Pre-Processing  Settings 
>  1.  Utilities  >  scroll  down  to  bottom  of  window 

Some  examples  of  what  would  constitute  an  Anaphora  Resolution  might  include: 

•  The  boy  took  the  hamburger  and  ate  it.  It  is  anaphoric  under  the  strict 
definition  (it  refers  to  the  hamburger). 

•  Larry  called  off  work  because  he  felt  sick,  he  is  anaphoric  (it  refers  to  Larry). 

Moreover,  think  of  a  sentence  being  taken  out  of  context  and  the  missing  information 
required  to  fully  explain  the  meaning. 

Example:  The  President  of  our  company  came  to  visit  us  yesterday.  The  visit  was  the  first 
time  he  had  come  to  Pittsburgh  since  1998. 
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If  the  second  sentence  stands  alone  by  itself,  it  is  necessary  to  resolve  the  anaphor. 

Automap  can  apply  this  pre-process  settings,  identify  Anaphora  Resolution  instances  and 
then  have  them  isolated  and  removed,  which  is  activated  by  selecting  the  first  and  middle 
button  under  the  Anaphora  Resolution  tool.  The  third  button  allows  you  to  output  the 
Anaphora  Resolution  contained  in  your  text.  Each  of  these  preprocessing  settings  can  be 
undone. 
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statement  Formation  Choices 


Statement  formation  ehoiees  ean  be  made  after  pre-proeessing  data  and  before  running 
analysis.  These  choices  define  if,  how,  and  where  concepts  will  be  linked.  Statement 
formation  settings  will  be  applied  to  the  highest  level  of  data  pre-processing  that  was 
applied.  If  no  pre-processing  was  performed,  statement  formation  settings  apply  to  the 
original  input  text.  If  the  user  does  not  modify  the  statement  formation  settings  AutoMap 
uses  a  set  of  default  settings. 

1.  Analysis  Settings 

To  specify  the  Analysis  Settings  or  make  the  Analysis  Settings,  use  the  Analysis  Settings 
Index  Card. 

Your  settings  will  be  automatically  applied  in  the  analysis.  You  do  not  need  to  confirm 
them. 

If  you  do  not  want  to  change  any  of  the  suggested  options  a  set  of  standard  settings  will 
be  applied.  The  screen  shot  below  shows  an  example  of  the  Automap  Analysis  Settings 
tab  in  P2. 


1.  Concept  List  2.  Union  Concept  List  3,  Pre-Processing  Settings  4.  Analysis  Settings 
1 .  Analysis  Settings  2.  Output  Options 
Coding  Ties 

Directionality  Strength 


0  Uni-directional  0 

O  Bi-directional 

Windowing 

Punctuation  Window-Size 

0  Ingnore  punctuation  completely 

O  Reset  window  at  the  end  of  pharagraphs  only  [  Find  size  of  largest  text  in  text  set 

O  Reset  window  at  the  end  of  pharagraphs  and  sentences 


Overview  on  the  possible  Settings: 


Coding  Ties 

Specify  the  way 
statements  are 
counted. 


Directionality 

Select  one  of  the 
following  two  possibilities 
by  checking  the  button. 


Uni-Directional 

(When  coding  a  tie,  only  lst->2nd 
concept  should  be  noted) 

Bi-Directional  (When  coding  a  tie,  both 
1st  <->  2nd  concept  shall  be  noted) 


Strength  Frequency  (The  cumulative  frequency  of 
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strength  will  be  printed 
out  by  default.  To  not 
print  out  Strength 
uncheck  the  item. 


Windowing 

Windowing  is  a 
method  that 
codes  the  text 
as  a  map  by 
placing 
relationships 
between  pairs  of 
Concepts  that 
occur  within  a 
window. 


Punctuation 

Select  one  option  by 
checking  the  radio 
button. 


Window  Size 

Select  one  window  size 
by  using  the  number 
chooser. 


every  existing  statement.) 

Item  not  checked:  Existence  of 
frequency  will  be  printed  out  (binary 
result). 

Ignore  punctuation  completely 

(Statements  will  be  placed  between  all 
concepts.) 

Reset  window  at  end  of  paragraphs 
only  (Statements  will  be  placed  only 
within  every  single  paragraph.) 

Reset  window  at  end  of  paragraphs 
and  sentences  (Statements  will  be 
placed  only  within  every  single  sentence.) 

Window  size  between  2  and  100. 

The  Window  Size  defines  how  distant 
concepts  can  be  and  still  have  a 
relationship.  Only  concepts  in  same 
window  can  form  statements. 


1. 1  Default  Settings 

If  you  do  not  want  to  change  any  of  the  suggested  options  the  analysis  will  be  done  with 
the  following  Standard  Settings: 


Semantic  Text  Analysis 

Automap  contains  text  analysis  utilities  to  help  you  in  the  pre-processing  and  data 
analysis  of  your  text  examples.  Take  time  to  become  familiar  with  them  as  they  allow 
you  to  quickly  work  with  your  text  examples. 
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1.  Browse  through  texts 

This  function  enables  you  to  quickly  jump  from  text  example  to  another.  All  panels  are 
synehronized  in  the  Browse  Menu.  A  series  of  sereen  shots  below  the  instruetion  ad 
emphasis  on  how  to  access  text  Automap's  text  browsing  features. 

How  to  use  the  Browse  Menu: 

To  go  backward  or  forward  text  by  text: 

Click  the  button  or  the  button  in  the  browse  menu  bar. 


To  go  to  first  or  last  text  in  the  text  set: 


File  Run  i 


l<  1 

< 

1/2 

> 

1  >1 

Go  to; 

OK 

6."Tsidg  after  MetaiMatfix  Thesaurus  7.  Texts  after  Sub-Matrix  Selection 

4.  Texts  after  Deletion  5.  Texts  after  Generalization 

1 .  Original  Texts  2.  Texts  after  Symbol  Removal  3.  Texts  after  Stemming 

Hr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Hrs. 
Brown  was  unhappy  with  the  dog.  She  yelled  at  it  saying  "You 
impossible  dog!"  But  the  dog  kept  eating  the  flowers  and 


To  go  to  first  or  last  text  in  the  text  set: 

Click  the  button  or  the”  «”  button. 


6.  Texts  after  Meta-Matrix  TheS30fus  7.  Texts  after  Sub-Matrix  Selection 

_ 4.  Texts  after  Deletion  5.  Texts  after  Generalization 

1 .  Original  Texts  2.  Texts  after  Symbol  Removal  3.  Texts  after  Stemming 

Hr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Hrs. 
Brown  was  unhappy  with  the  dog.  She  yelled  at  it  saying  "You 
impossible  dog!"  But  the  dog  kept  eating  the  flowers  and 

To  go  to  a  specific  text: 
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Enter  the  text  number  in  text  field  right  next  to  the  Go  to  command  and 
hit  OK. 


6.  Texts  after  Meta-Matrix  ThesaurS'v..^.  7.  Texts  g£ter^b-Matrix  Selection 

4.  Texts  after  Deletion  b,  lexts  after  Generalization _ 

1 .  Original  Texts  2.  Texts  after  Symbol  Removal  3.  Texts  after  Stemming 

Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs. 

Brown  was  unhappy  with  the  dog.  She  yelled  at  it  saying  "You 
impossible  dog!"  But  the  dog  kept  eating  the  flowers  and 

The  name  of  the  currently  selected  text  is  displayed  on  the  Browse  menu. 

These  files  can  be  browsed: 

•  All  input  texts  (PI,  No.  1.  Input  Text  index  card). 

•  All  texts  after  each  stage  of  pre-processing  (all  index  cards  of  PI). 

•  Concept  lists  per  text  (original  input  text  or  texts  after  each  stage  of  pre¬ 
processing). 

•  Map  (P3)  and  Statistics  (P4)  outputs  that  relate  to  the  text  currently  displayed 
on  the  index  cards  of  PI. 

1. 1  Example  for  Browse  Menu 


The  example  shows  a  part  of  the  “Our  Text  l.txt”  in  panel  PI  tab  no.  1  Original  Texts. 
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The  browse  menu  tells  you  several  important  facts: 


•  The  data  set  opened  contains  two  texts. 

•  The  first  of  the  loaded  texts  is  displayed. 

•  The  filename  of  the  displayed  text. 

2.  Concept  List 

The  Concept  List  is  displayed  in  panel  P2  tab  No.  1  Concept  List.  The  Concept  List  is 
created  automatically  once  a  text  or  a  set  of  texts  are  loaded  or  modified  in  Automap. 

The  concept  list  tells  you  several  important  facts  about  your  text: 


•  The  number  of  concepts  found  in  the  text  displayed  in  PI  tab  No.  1  Original 
Text. 

•  Related  frequencies  of  those  concepts. 

•  Cumulated  unique  concepts  and  total  concepts  contained  in  the  data  set. 

Tip!  The  number  of  unique  concepts  considers  each  concept  only 
once,  whereas  the  number  of  total  concepts  considers  repetitions  of 
concepts. 

By  default,  the  Concept  List  is  sorted  by  decreasing  frequency  of  concepts.  To  sort  the 
list  alphabetically,  click  on  the  first-column  header  Concept.  In  order  to  resort  the  list, 
click  on  the  header  of  the  second-column  header  Frequency. 

2. 1  Example  for  Concept  List 

The  example  below  shows  a  part  of  the  Concept  List  for  the  text  displayed  in  the  browse 
menu.  The  Table  is  ordered  by  Frequency.  The  concept  list  contains  more  entries  than  the 
interface  can  display: 


2007  AutoMap  Users  Guide  -  Page  73 


Click  this 
column  heading 
to  sort  concepts 
alphabetically. 


Click  this  column 
heading  to  sort 
concepts  by 
frequency. 


3.  Create  and  refresh  Union  Concept  List 

The  Union  Concept  List,  found  in  panel  P2,  differs  from  the  Concept  List  (tab  no.  1)  in 
that  it  considers  concepts  across  all  texts  loaded  in  Automap,  rather  than  one  single  text 
file.  There  are  several  key  pieces  of  information  the  Union  Concept  List  tells  you: 

•  Concepts  contained  in  all  loaded  text  sets. 

•  Related,  cumulative  frequencies  of  concepts  in  all  text  sets. 

•  Cumulative  unique  and  total  concepts. 

Union  Concept  List  results  are  displayed  on  tab  No.  2  Union  Concept  List  in  P2. 
However,  you  must  first  refresh  the  union  concept  list  from  the  file  menu,  before  viewing 
your  results  on  the  No.  2  Union  Concept  tab.  The  union  concept  list  can  be  refreshed  after 
each  step  of  pre-processing  in  order  to  visualize  the  impact  of  pre-processing  operations 
on  the  union  of  concepts. 

To  refresh  the  Union  Concept,  from  the  drop-down  menu  bar: 

File  menu  >  Refresh  Union  Concept  List. 

The  ellipse  in  the  screen  shot  below  shows  were  to  access  the  Refresh  Union  Concept 
List  from  the  drop-down  menu  bar: 
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lun  Analysis  Tools  Help 


Open  single  file 


Open  multiple  files  (Select  folder) 

Open  Delete  List 

Open  Generalization  Thesaurus 

Open  Meta-Matrix  Thesaurus 

ripon 

Create  and  Refresh  Union  Concept  List 


'OULpuL  LiLeroge  Minagar  "* 

iJ  Save  Concept  List  per  Text 
y  Save  Union  Concept  List 
y  Save  applied  Delete  List 
y  Save  applied  Generalization  Thesaurus 
y  Save  applied  Meta-Matrix  Thesaurus 
y  Save  Sub-Matrix  Selection 
y  Save  Text(s)  after  Stemmer  applied 
y  Save  Text(s)  after  Delete  List  applied 


The  call  out  box  in  the  screen  shot  below,  shows  where  to  locate  the  Union  Concept  tab, 
which  will  contain  the  results  of  the  Refresh  Union  Concept  List  analysis: 
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Union 

Concept 

List 


Your  results  will  be  displayed  in  tab  No  2.  Union  Concept  List.  The  black  ellipse  in  the 
screen  shot  below  highlights  where  to  find  this  tab  in  the  Automap  GUI. 


Fl«  Run  Arwlyss  Tools  Help 

''^CZDI 


>1  Goto: 


or 


6.  Texts  efter  Hete-Mathx  Tbesatrca  7.  Texts  ytei  Sob-Matrix  Setecbcn 

4  OfMwn  5.  Texts Gef^4*z«»on 

I .  Texts  2.  Texts  after  Syirbol  Removal  3.  Texts  after  Stemmng 

nr.  Cray's  brotm  dog  ate  the  lotus  blossom  at  10  cun.  Hcs. 
Brown  was  unhappy  with  the  dog.  She  yelled  at  it  saying  **you 
impossible  dog*"  But  the  dog  kept  eating  the  flowers  and 
weeds.  She  asked  Hr.  Cray  to  stop  the  dog.  He  couldn't.  Krs. 
Broun  planted  roses  and  weeded  the  garden.  The  silly  dog  % 
dug  up  the  roses  looking  for  a  vole  on  June  12,  1880.  Veeding 
was  no  longer  needed. 

Prof.  Darren,  Hrs.  Brown  (  Hr.  Cray  saet  the  next  day  to 
concoct  John  end  Hrs.  Brown  put  up  a 


oncoct^^^^’^yian.  John 
.  CaKe((  List  2.  Uhion  Coocapt  List  3.  a, 


SwMKtic  network  of  curert  Twt 


e-Proc«mg  SetUngt  e.  Anelw  Settn^t 
Oilit.llft  AddtoD.l.tc...  Ttvwlabonn. 


Nunixr  of  Unique  CoTKepts:  100 
Nimber  of  Totel  CorKepts:  229 


In  Delete  Let:  0 
In  Delete  LM:  0 


In  Genecefeobon  Theuunjs:  0 
In  Generolzotion  ThetOLius:  0 


3.  Network  enelytic  meMuiet 


I .  Action  TrKw  Pml 


SeCttngsMIoiiray's  Stv 


a 


ti 


Oeot  I 
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By  default,  the  list  is  sorted  by  decreasing  frequency  of  concepts.  In  order  to  sort  the  list 
alphabetically  click  on  the  first-column  header  Concept.  To  re-sort  the  list,  click  on  the 
second-header  column  Frequency. 

Note:  The  number  of  unique  concepts  considers  each  concept  only 
once,  whereas  the  number  of  total  concepts  also  considers 
repetitions  of  concepts. 

3. 1  Save  Union  Concept  List 

To  save  a  Union  Concept  List  follow  these  steps: 

File  menu  >  Save  Union  Concept  List. 

A  file  chooser  will  pop  up.  The  black  ellipse  below  highlights  how  to  save  a  Union 
Concept  List  from  the  drop-down  menu  bar. 


Run  Analysis  Tools  Help 

l/^  Open  single  file 

Open  multiple  files  (Select  folder) 

Open  Delete  List 

> 

Open  Generalization  Thesaurus 

* 

Open  Meta-Matrix  Thesaurus 

> 

Open  Sub-Matrix  Selection 

* 

Create  and  Refresh  Union  Concept  List 

Output  Storage  Manager 

Save  Concept  List  per  Text 

jJ  Save  Union  Concept  List 

d  Save  applied  Delete  List 

d  Save  applied  Generalization  Thesaurus 

d  Save  applied  Meta-Matrix  Thesaurus 

d  Save  Sub-Matrix  Selection 

d  Save  Text(s)  after  Stemmer  applied 

d  Save  Text(s)  after  Delete  List  applied 

d  Save  Text(s)  after  Generalization  Thesaurus  applied 

d  Save  Text(s)  after  Meta-Matrix  Thesaurus  applied 

d  Save  Text(s)  after  Sub-Matrix  selection  applied 

Clear 

Exit 
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3.2  Example  for  Union  Concept  List 


Let  us  walk  through  an  example  of  creating  a  Union  Concept  List  working  with  our  text 
examples  from  above.  They  are  restated  below  for  you  convenience: 

1)  Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs.  Brown  was  unhappy  with 
the  dog.  She  yelled  at  it  saying  "You  impossible  dog!"  But  the  dog  kept  eating  the 
flowers  and  weeds.  She  asked  Mr.  Cray  to  stop  the  dog.  He  couldn't.  Mrs.  Brown  planted 
roses  and  weeded  the  garden.  The  silly  dog  %  dug  up  the  roses  looking  for  a  vole  on  June 
12,  1880.  Weeding  was  no  longer  needed. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  the  next  day  to  concoct  a  plan.  John  Darren 
and  Mrs.  Brown  put  up  a  scarecrow.  She  thought  it  would  scare  the  dog.  Mr.  Craye  putup 
a  fence.  Problem  solved.  Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils,  and  roses. 

2)  Mr.  Cray's  brown  dog  stopped  eating  the  lotus  blossom  at  12  pm  the  next  day.  Mrs. 
Brown  was  now  happy  with  the  dog.  She  said  "You  good  dog!"  The  dog  no  longer  ate  the 
flowers  and  weeds.  Mr.  Cray  was  pleased  too. 

Mrs.  Brown  watered  the  roses  and  fertilized  the  garden  on  June  13,  1880. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  over  dinner  and  discussed  how  the  plan  had 
worked.  John  Darren  and  Mrs.  Brown  would  take  down  the  scarecrow  the  following 
week.  She  thought  it  was  too  scarry  for  the  dog.  Mr.  Craye  painted  his  fence.  Then  Mrs. 
Brown  watered  lotus,  carnations,  daffodils,  and  roses. 

The  first  step  is  to  load  in  your  multiple  texts  from  which  we  will  create  and  save  a  Union 
Concept  List.  An  empty  AutoMap  GUI  is  shown  below  before  any  text  is  loaded  into  it. 
This  is  what  you  will  see  when  you  first  run  AutoMap: 
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From  the  drop-down  menu: 

File  >  Open  Multiple  Files  >  (select  location  of  folder  on  applicaple 
drive  containing  your  text  examples) 
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Select  "Open” 


The  screen  shot  below  displays  the  Automap  GUI  after  our  text  examples  have  been 
loaded  into  it. 


To  run  the  Create  and  Refresh  Union  Concept  Utility  based  on  multiple  texts: 

File  >  Create  and  Refresh  Union  Concept  List 

The  following  series  of  screen  shots  present  a  step-by-step  on  how  to  create  an  refresh  a 
union  concept  list: 
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R 


jun  Analysis  Tools  Help 
{f  Open  single  file 
Open  multiple  files  (Select  folder) 

Open  Delete  List 
Open  Generalization  Thesaurus 
Open  Meta-Matrix  Thesaurus 
Onsn  Si 

Create  and  Refresh  Union  Concept  List 
OULpuL  Peerage  Minagar  ^ 

ilJ  Save  Concept  List  per  Text 

y  Save  Union  Concept  List 

IWJ  Save  applied  Delete  List 

y  Save  applied  Generalization  Thesaurus 

y  Save  applied  Meta-Matrix  Thesaurus 

y  Save  Sub-Matrix  Selection 

y  Save  Text(s)  after  Stemmer  applied 

y  Save  Text(s)  after  Delete  List  applied 
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Select  "Save  Union  Concept  List”  from  the  drop-down  menu 


^  Run  Analysis  Tools  Help 
1^  Open  single  file 

Open  multiple  files  (Select  folder) 

Open  Delete  List 

Open  Generalization  Thesaurus 

Open  Meta-Matrix  Thesaurus 

Open  Sub-Matrix  Selection 

Create  and  Refresh  Union  Concept  List 

Output  Storage  Manager 

y  Save  Concept  List  per  Text 


> 


> 


lU  Save  Union  Conce 


y  Save  applied  Delete  List 

y  Save  applied  Generalization  Thesaurus 

y  Save  applied  Meta-Matrix  Thesaurus 

y  Save  Sub-Matrix  Selection 

y  Save  Text(s)  after  Stemmer  applied 

y  Save  Text(s)  after  Delete  List  applied 

y  Save  Text(s)  after  Generalization  Thesaurus  applied 

y  Save  Text(s)  after  Meta-Matrix  Thesaurus  applied 

y  Save  Text(s)  after  Sub-Matrix  selection  applied 

Clear 

Exit 

The  result  is  shown  in  the  screen  shot  below: 
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The  table  is  ordered  by  Frequency.  The  concept  list  contains  more  entries  than  the 
interface  can  display.  The  displayed  Union  Concept  List  indicates  there  are  100  unique 
concepts  and  229  total  concepts  in  the  data  set. 
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Fte  R(X>  An<»^sij  Tool*  FMp 

I  l<  I  1  <  I  1  n  I  >  I  [  >1  I  Goto: 


6.  Text*  after  Mete-M«trix  The*axjrus  7.  Tefts  aftei  Sob-Matrix  Selection 

a.  Texts  after  Oelebon  S.  Texts  after  Genet atation 

1.  Ori^nal  Text*  2.  Text;  after  Symbol  Removal  3.  Text*  after  Stemrrang 

He.  Cray's  brovn  dog  ace  Che  locus  blossom  aC  10  am.  Hrs. 
Brown  was  unhappy  wich  Che  dog.  She  yelled  aC  ic  saying  "You 
impossible  dog'”  Buc  che  dog  kepc  eacing  che  {lowers  and 
weeds.  She  ashed  Hr.  Cray  Co  sCop  Che  dog.  He  couldn'c.  Hrs. 
Brown  planced  roses  and  weeded  che  garden.  The  silly  dog  % 
dug  up  che  roses  looking  £or  a  vole  on  June  12,  1880.  Heeding 
was  no  longer  needed. 


.  Concept  List  2-  Union  Concept  let  3,  Pre-Ptoce**ing  Settrg*  •».  Analysi*  Settng* 

Frequency  In  Delete  Let  Add  to  Delete  . . .  Trartslabon  In  . . . 


Ntarber  of  Unique^Spneeptk:  100  in  Delete  Let:  0  In  Generalcaticn  Thesaurus:  0 
Number  of  Total  ConceiS:  229  l!M)eleteli*t:  0  In  Generaliatian  Thesaurus:  0 


4.  Semantic  Network  Analysis 

Semantic  Network  Analysis  can  be  run  on  original  Input  text(s)  or  texts  that  have  been 
pre-processed  with  a  Delete  List  and/  or  a  Generalization  thesaurus. 

Before  you  run  map  analysis  make  sure  that  you  have  completed  the  following  steps: 

•  Pre-processed  the  texts  if  you  wish  to. 

•  Specified  the  analysis  settings.  If  you  do  not  do  so,  the  default  setting  default 
settings  will  be  applied. 

•  Specified  additional  output  options  if  you  wish  to.  If  you  do  not  select 
additional  output  options,  the  default  outputs  default  outputs  will  be 
generated 

If  you  wish  to  analyze  a  single  text,  click  the  Run  Analysis  menu  and  select  Single  Map 
Analysis. 

If  you  wish  to  analyze  a  set  of  texts,  click  the  Run  Analysis  menu  and  select  Multiple 
Map  Analysis. 

The  Results  will  be  displayed  on  P3  on  the  Map  index  card  and  on  P4  on  the  Stat  index 
card.  If  you  had  requested  additional  outputs,  those  will  be  generated  and  stored  under  the 
directories  specified  under  the  Section  Additional  Outputs. 
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Other  Semantic  Network  Analysis  Points  to  Consider 

•  If  you  have  analyzed  multiple  texts,  you  can  browse  through  the  results  and 
see  the  related  texts. 

Results  of  multiple  analyses  are  automatically  saved  in  folder  named  "output" 
under  directory  where  AutoMap  2.0  is  installed. 

•  This  output  folder  contains  a  map  file  (nameOfText.map)  and  a  stat  file 
(nameOfText.stat)  for  each  text  analyzed  as  well  as  a  stat_output.xls  file  that 
contains  the  stats  of  all  texts. 

•  The  "Stat  Output"  folder  is  overwritten  with  every  new  analysis  you  run.  So  if 
you  want  to  save  the  results  of  a  current  "Stat  Output"  folder  just  rename  the 
folder. 


4. 1  Example  for  Semantic  Network  Analysis 

An  extract  from  the  Denmark  text  was  used  as  input: 


Reporters  said  hundreds  of  people  emerged  from  shops  in  Copenhagen  city  centre  to 
see  what  was  happening,  and  used  their  mobile  phones  to  contact  their  families. 


The  text  was  pre-processed  with  AutoMap's  customized  extensive  Delete  List.  These  are 
the  resulting  texts: 


Input  text 


Tool  used 


Mr.  Cray's 
brown  dog 
ate  the  lotus 
blossom  at 
10  am.  Mrs. 
Brown  was 
unhappy 
with  the 
dog.  She 
yelled  at  it 
saying  "You 
impossible 
dog!"  But 
the  dog  kept 
eating  the 
flowers  and 
weeds.  She 
asked  Mr. 
Cray  to  stop 
the  dog.  He 
couldn't. 
Mrs.  Brown 
planted 


AutoMap's  customized 
extensive  Delete  List: 

a 

and 

as 

at 

but 

for 

he 

her 

her 

hers 

him 

his 

i 

it 

its 

me 

mine 

my 

nor 


Setting  Resulting  text 


Direct 

Adjacency 

(default) 


Mr.  Cray's  brown  dog  ate  lotus 
blossom  10  am.  Mrs.  Brown 
was  unhappy  with  dog.  yelled 
saying  "You  impossible  dog!" 
dog  kept  eating  flowers 
weeds,  asked  Mr.  Cray  stop 
dog.  couldn't.  Mrs.  Brown 
planted  roses  weeded  garden, 
silly  dog  %  dug  up  roses 
looking  vole  on  June  12,  1880. 
Weeding  was  no  longer 
needed. 


Prof.  Darren,  Mrs.  Brown  & 

Mr.  Cray  met  next  day  concoct 
plan.  John  Darren  Mrs.  Brown 
put  up  scarecrow,  thought  scare 
dog.  Mr.  Craye  put  up  fence. 
Problem  solved.  Then  Mrs. 
Brown  planted  lotus,  carnations, 
daffodils,  roses.. 


Rhetorical 

Adjacency 


Mr.  Cray's  brown  dog  ate  xxx 
lotus  blossom  xxx  10  am.  Mrs. 
Brown  was  unhappy  with  xxx 
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roses  and 

of 

dog.  XXX  yelled  xxx  xxx  saying 

weeded  the 

or 

"You  impossible  dog!"  xxx  xxx 

garden.  The 

our 

dog  kept  eating  xxx  flowers  xxx 

silly  dog  % 

she 

weeds,  xxx  asked  Mr.  Cray  xxx 

dug  up  the 

so 

stop  xxx  dog.  xxx  couldn't.  Mrs. 

roses 

that 

Brown  planted  roses  xxx 

looking  for  a  the 

weeded  xxx  garden,  xxx  silly 

vole  on  June  their 

dog  %  dug  up  xxx  roses  looking 

12,  1880. 

theirs 

xxx  xxx  vole  on  June  12,  1880. 

Weeding 

them 

Weeding  was  no  longer  needed. 

was  no 

they 

longer 

to 

Prof.  Darren,  Mrs.  Brown  & 

needed. 

us 

Mr.  Cray  met  xxx  next  day  xxx 

we 

concoct  xxx  plan.  John  Darren 

Prof. 

who 

xxx  Mrs.  Brown  put  up  xxx 

Darren,  Mrs. 

whoever 

scarecrow,  xxx  thought  xxx  xxx 

Brown  & 

whom 

scare  xxx  dog.  Mr.  Craye  put  up 

Mr.  Cray 

whomever 

xxx  fence.  Problem  solved. 

met  the  next 

will 

Then  Mrs.  Brown  planted  lotus. 

day  to 

would 

carnations,  daffodils,  xxx  roses. 

concoct  a 

you 

plan.  John 

your 

Darren  and 

yours 

Mrs.  Brown 

yourself 

put  up  a 
scarecrow. 
She  thought 
it  would 
scare  the 
dog.  Mr. 
Craye  put  up 
a  fence. 
Problem 
solved.  Then 
Mrs.  Brown 
planted 
lotus, 
carnations, 
daffodils, 
and  roses. 


Next  we  run  Map  Analysis  on  both  texts  using  AutoMap's  default  Analysis  Settings 

These  are  the  Map  and  Statistics  These  are  the  Map  and  Statistics 
outputs  for  the  first  text  (direct  outputs  for  the  second  text  (rhetorical 
adjacency):  adjacency): 
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Map: 

Map: 

5 

mrs 

2 

mr 

2 

mr 

2 

brown 

2 

darren 

1 

you 

2 

brown 

1 

worked 

1 

you 

1 

weeds 

1 

worked 

1 

watered 

1 

with 

1 

was 

1 

week 

1 

too 

1 

weeds 

1 

then 

1 

watered 

1 

take 

1 

was 

1 

stopped 

1 

too 

1 

said 

1 

thought 

1 

prof 

1 

then 

1 

pleased 

1 

take 

1 

plan 

1 

stopped 

1 

over 

1 

scary 

1 

on 

1 

scarecrow 

1 

now 

1 

said 

1 

no 

1 

roses 

1 

next 

1 

prof 

1 

mr 

1 

pm 

1 

met 

1 

pleased 

1 

lotus 

1 

plan 

1 

longer 

1 

painted 

1 

June 

1 

over 

1 

John 

1 

on 

1 

happy 

1 

now 

1 

had 

1 

no 

1 

good 

1 

next 

1 

garden 

1 

mr 

1 

following 

1 

met 

1 

fence 

1 

lotus 

1 

dog 

1 

longer 

1 

discussed 

1 

June 

1 

day 

1 

John 

1 

darren 

1 

how 

1 

craye 

1 

happy 

1 

Cray's 

1 

had 

1 

Cray 

1 

good 

1 

carnations 

1 

garden 

1 

brown 
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Stat: 


1880 

13 

12 


#  of  concepts  analyzed: 
unique:  62 

total:  88 

#  of  concepts  in  statements: 
unique:  54 

total:  60 

#  of  isolated  concepts: 
unique:  12 

total:  13 

#  of  statements: 
unique:  54 
total:  60 

Density  (based  on  Statements): 
unique:  0.87 
total:  0.97 


1 
1 
1 

Stat: 

#  of  concepts  analyzed: 
unique:  60 

total:  84 

#  of  concepts  in  statements: 
unique:  55 

total:  61 

#  of  isolated  concepts: 
unique:  6 

total:  7 

#  of  statements: 
unique:  55 
total:  61 

Density  (based  on  Statements): 
unique:  0.92 
total:  1.02 


Note:  For  more  information  about  the  impact  of  coding  choices  on 
map  analysis  results  you  might  have  a  look  at  our  publications 
(http://www.casos.cs.cmu.edu/projects/automap/publications.html). 


Content  Analysis 

From  content  analysis  to  semantic  networks 
i.  Open  AutoMap 

On  Empire: 

•  Navigate  to  C:\Documents  and  Settings\Carley\Desktop\Tools. 

•  Double-click  the  Run  Automap  icon.  The  Automap  Graphical  User  Interface 
(GUI)  pops  up. 

From  anywhere  else:  (requires  a  web  connection) 
http://www.casos.cs.cmu.edu/projects/automap/software.html 
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II.  Overview  of  the  AutoMap  Graphical  User  Interface  (GUI) 

Below  is  a  screen  capture  of  the  AutoMap  GUI: 


Menus 


Ifowse  Menu 


Index  Cards 


Panel 


The  AutoMap  GUI  is  divided  into  four  primary  quadrants,  or  panels:  PI  (top  left),  P2 
(bottom  left),  P3  (top  right),  and  P4  (bottom  right). 

A  drop-down  menu  bar  at  the  top  of  the  window  provides  access  to  analysis  tools  and 
utilities.  The  Browse  Menu  allows  you  to  quickly  navigate  between  texts  you  have  loaded 
into  Automap. 

The  P2  panel  can  be  edited,  but  the  other  panels  (PI,  P3,  P4)  cannot.  Information 
displayed  on  each  panel  is  always  related  to  the  information  displayed  in  the  other  three 
panels.  The  Browse  Menu  relates  to  all  four  panels  at  the  same  time. 

III.  Load  multiple  text  documents  into  AutoMap 

In  order  to  analyze  multiple  texts  simultaneously,  you  will  need  to  store  the  texts  together 
in  one  folder. 

Example:  C:\Documents  and  Settings\carley\Desktop\AutomapLessonl\TextInputFiles 

•  In  the  AutoMap  menu  bar,  select  File,  then  Open  Multiple  Files.  A  folder 
chooser  window  will  pop  up. 
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•  Navigate  to  the  folder  containing  the  texts  you  wish  to  analyze,  and  single¬ 
click  on  it. 

Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLessonl\TextInputFiles 

•  Click  the  Open  button  in  the  bottom  right  corner. 

The  first  text  will  be  displayed  in  panel  PI  under  the  tab  titled  1. Original  Texts.  You  can 
use  the  Browse  Menu  to  browse  through  the  texts. 

Below  is  a  screen  capture  of  the  first  text  display: 


The  Browse  Menu 
alows  you  to  quickly 
navigate  from  one  text 
sample  to  the  next 


IV.  Carry  out  a  simple  content  analysis 

A  simple  content  analysis  determines  the  frequencies  of  all  words  in  a  text  document. 

Examine  the  Concept  List: 

The  Concept  List  considers  concepts  in  each  text  file  individually.  It  is  created 
automatically  when  a  text  or  set  of  texts  is  loaded  into  Automap,  and  is  displayed  in  P2 
under  the  tab  titled  1  .Concept  List. 
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The  Concept  List  tells  you  several  important  facts  about  each  text  in  your  set: 

•  The  number  of  concepts  found  in  the  text  displayed  in  PI  under  the  tab  titled 
1. Original  Text. 

•  Related  frequencies  of  those  concepts 

•  Cumulated  unique  concepts  and  total  concepts  contained  in  the  data  set 

Note:  The  number  of  unique  concepts  considers  each  concept  only  once. 
The  number  of  total  concepts  considers  repetitions  of  concepts. 

By  default,  the  Concept  List  is  sorted  by  decreasing  frequency  of  concepts.  To  sort  the 
list  alphabetically,  click  on  the  first-column  header  Concept.  To  resort  the  list  by 
frequency,  click  on  the  second-column  header  Frequency. 

Below  is  a  screen  capture  showing  part  of  the  Concept  List,  sorted  by  frequency,  for  the 
first  document  in  our  loaded  folder  of  texts  (Textl-in-user's  guide.txt): 


Click  this  column  heading 
to  sort  concepts 
alphabetically. 


Click  this  column  heading  to  sort 
concepts  by  frequency. 


Create  and  refresh  the  Union  Concept  List: 

The  Union  Concept  List  considers  concepts  across  all  texts  in  a  set  (unlike  the  Concept 
List,  which  considers  each  text  in  the  set  individually).  It  is  located  in  P2  under  the  tab 
titled  2.Union  Concept  List. 

The  Union  Concept  List  tells  you  several  important  facts  about  your  text  set: 

•  Concepts  contained  in  all  loaded  text  sets 
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•  Related,  cumulative  frequencies  of  concepts  in  all  text  sets 

•  Cumulative  unique  and  total  concepts. 

Before  viewing  your  Union  Concept  List,  you  must  refresh  it.  In  the  Menu  Bar,  choose 
File,  then  Refresh  Union  Concept  List. 

The  Union  Concept  List  can  be  refreshed  after  each  step  of  pre-processing  to  visualize 
the  impact  of  pre-processing  operations  on  the  union  of  concepts. 

Below  is  a  screen  capture  showing  where  to  find  the  Refresh  Union  Concept  List 
command: 


123  Todt  H^> 

^  OpsnsngleNe 

^  Open  mAple  fies  (Select  folder) 

OpwtMMtLat 

Open  Gererefesbon  TbeMWB 

¥ 

Open  Mite*Matm  Thetewt 

»  ] 

Oeete  «nd  Refresh  Uraon  Concept  Lst 

1 

Conupt  Ut  p«r  T*il 


J  Uraon  Ccncept  Let 

I 

^  appted  Delete  Lat 

$«ve  eppled  <>rwi*utieri  ThMAjue 
^  Sifve  eppled  Het**Metm  Tb*f*eve 
^  Serve  SU>44«brtx  Selection 

After  refreshing  your  Union  Concept  List,  you  can  view  it  under  the  2. Union  Concept 
List  tab  in  P2.  Below  is  a  screen  capture  showing  where  this  tab  is  located  in  the  GUI: 


FI*  RinAnahiK 

Took  Hi^ 

□DlHd 

lit  1  >  I  [  H  jUCtO: 

PC  J 

iw  Tests  Hete-Htrn  Thesa^us 

7.  Te*ts  ifter 

,jfterPeld«ni 

i.  Inu 

I.OigralUiH  J 

Rr.  Crey'e  bcoim  dog  ete  the  lotus  bloesoM  «t  ] 
Bcovn  ves  uobeppy  eith  cbe  dog.  She  yelled  et  ] 
Impossible  dod'*  But  the  doa  kept  eetlop  the  t, 
e^.  She  osked  Hr.  Crey  to  stop  the  do^.  Me  c 
BrovKjleotcd  coses  and  weeded  the  garden.  The 
dug  up  Che  coses  looking  tor  a  vole  on  June  12. 


2.LluenConc«ptUit 


QSet&ngi  4. 


the 

ic:  n 

brO«n 

dog 

1 

mi 

*.  w 

a 

end 

^  C 

fIV 

4 

roses 
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By  default,  the  list  is  sorted  by  decreasing  frequency  of  concepts.  To  sort  the  list 
alphabetically,  click  on  the  first-column  header  Concept.  To  re-sort  the  list  by  Frequency, 
click  on  the  second-header  column  Frequency. 

Note:  The  number  of  unique  concepts  considers  each  concept  only  once. 
The  number  of  total  concepts  also  considers  repetitions  of  concepts. 

y.  Save  your  Union  Concept  List  as  a  csv  fiie 

To  specify  a  location  for  your  Union  Concept  List  file,  choose  File  in  the  Menu  Bar,  then 
Output  Storage  Manager. 

Below  is  a  screen  capture  showing  where  to  find  this  in  the  GUI: 


Rtft  AnKrss  Tods  Mdp 

^  Opsnsnde^ 

^  Op«n  mJtJtM  Mn  (SdMt  foldiv) 

OpanOdatoUK 

a 

OcNn  Ocnarateabon  Hwcrua 

a  * 

OpanMeta  Hatoix  The»«k^tj» 

a  1 

OpanSub^tdrtt  faiigacff 

a 

CrMtP  and  Umon  Concap*  UK 

< 

Storapa  Mar^apt^ 

« 

4J  S«vt  Conc«pt  Urt  per  Text 
S«v«  uvon  Concept  UK 

S«fv«  nipied tjenerelsetion  ThMMut 
S«v«  Bppttd  Mete  Metroc  Tbes«xus 
^  Serve  Sqb-Metrw  5iiicttQn 
Serve  Tei«»)  efter  Oeinng 
Seve  TeidKe)  efter  Stemeer  eppfted 
■fj  Seve  Textts)  efter  Mete  Ust  apptod 
^  Seve  Text<*)efter  Ceneretzetion  TheMcexis  eppM 
^  Seve  TeK<*)  efter  Mete-Metru  Thetewe  eppM 
jiJ  Serve  Text<s)efter  Si^MetitKfeieebon  eppPed 
Oev 
txft 


A  window  will  pop  up  listing  all  pre-processing  output  storage.  This  is  the  Output 
Storage  Manager. 

The  Union  Concept  List  appears  as  the  second  item  in  the  Output  Storage  Manager.  Click 
the  Save  file  as. . .  button  to  the  right  of  the  default  file  pathway.  In  the  file  chooser 
window  that  pops  up,  browse  to  the  pathway  you  want. 

Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLessonl\UnionConceptList 

Below  is  a  screen  capture  showing  where  to  find  the  correct  Save  file  as. . .  button  in  the 
Output  Storage  Manager: 
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Click  the  Save  Settings  button  at  the  bottom  right  comer  of  the  Output  Storage  Manager. 
Close  the  Output  Storage  Manager  window  by  clicking  on  the  red  X  at  the  top  right 
corner. 

In  the  Automap  Menu  Bar,  choose  File,  then  Save  Union  Concept  List. 

VI.  View  your  csv  file  in  Microsoft  Excel 

Navigate  to  the  location  of  your  new  csv  file  and  double-click  on  its  icon. 

Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLessonl\UnionConceptList 
A  box  titled  Open  With  pops  up.  Click  on  Microsoft  Excel,  then  click  OK. 

Note:  You  must  have  MS  Excel  installed  on  your  computer  in  order  to 
view  your  csv  file  this  way. 

VII.  Save  and  view  your  results  as  a  binary  csv  file 

Eollow  Steps  3  and  4. 

Vill.  Carry  out  a  simple  Semantic  Network  Analysis 

A  simple  network  analysis  involves  no  thesauri  and  uses  all  words. 
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A  Semantic  Network  Analysis  can  be  run  on  original  input  texts  or  on  texts  that  have 
been  pre-processed  with  a  Delete  List  and/or  a  Generalization  Thesaurus. 

Before  running  the  analysis: 

Make  sure  you  have  completed  the  following  steps: 

•  Pre-process  the  texts  if  you  wish  to 

•  Specify  the  analysis  settings  (otherwise  the  default  settings  will  be  applied) 

•  Specify  additional  output  options  if  you  wish  to  (if  not,  the  default  outputs  will 
be  generated) 

About  the  analysis  settings: 

Use  the  Analysis  Settings  index  card  to  specify  the  analysis  settings.  Your  settings  will 
automatically  be  applied  to  the  analysis  (you  do  not  need  to  confirm  them). 

Below  is  a  screen  capture  showing  where  to  find  the  Analysis  Settings  index  card  in  P2: 


1 .  Concept  List 

2.  Union  Concept  List 

3.  Pre-Processing  Settings , 

4.  Analysis  Settings 

1.  Analysis  Settings  2.  Output  Options 


The  following  chart  lists  the  possible  analysis  settings: 


Coding  Ties 

Specify  the  way  statements 
are  counted. 


Directionality 

Select  one  of  the 
following  two 
possibilities  by  checking 
the  button. 


Uni-Directional  (When  coding 
a  tie,  only  lst->2nd  concept 
should  be  noted) 
Bi-Directional  (When  coding  a 
tie,  both  1st  <->  2nd 
concept  shall  be  noted) 


Strength  Frequency  (The  cumulative 

Strength  will  be  printed  frequency  of  every  existing 
out  by  default.  To  not  statement.) 
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print  out  Strength 
uncheck  the  item. 


Windowing 

Windowing  is  a  method  that 
codes  the  text  as  a  map  by 
placing  relationships 
between  pairs  of  Concepts 
that  occur  within  a  window. 


Punctuation 

Select  one  option  by 
checking  the  radio 
button. 


Window  Size 

Select  one  window  size 
by  using  the  number 
chooser. 


Item  not  checked: 

Existence  of  frequency  will  be 
printed  out  (binary  result). 

Ignore  punctuation 
completely  (Statements  will 
be  placed  between  all 
concepts.) 

Reset  window  at  end  of 
paragraphs  only 

(Statements  will  be  placed 
only  within  every  single 
paragraph.) 

Reset  window  at  end  of 
paragraphs  and  sentences 

(Statements  will  be  placed 
only  within  every  single 
sentence.) 

Window  size  between  2  and 

100. 

The  Window  Size  defines  how 
distant  concepts  can  be  and 
still  have  a  relation  ship.  Only 
concepts  in  same  window  can 
form  statements. 


If  you  do  not  want  to  change  any  of  the  suggested  options,  the  analysis  will  be  done  with 
a  set  of  standard  (default)  settings. 

Below  is  a  screen  capture  showing  the  standard  settings: 


About  the  output  options: 
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The  map  and  the  statistic  output  generated  by  AutoMap  are  displayed  in  P3  and  P4, 
respectively.  In  addition,  AutoMap  offers  further  output  options  that  can  be  chosen  in  the 
10. Output  Options  index  card  in  P2.  All  additional  outputs  are  only  generated  after 
analyses  are  run. 

Below  is  a  screen  capture  showing  the  10. Output  Options  index  card  in  P2: 


1 .  Concept  List  2.  Union  Concept  List  3.  Pre-Processing  Settrgs  Analysis  Settings 


For  all  types  of  multiple  analysis,  term  distribution  lists  and  matrices  can  be  chosen  as 
output  options  in  the  upper  two  fields  of  the  10. Output  Options  index  card  in  P2. 

Points  to  consider: 

•  No  list  or  matrix  is  generated  by  default. 

•  Lists  or  matrices  are  only  generated  if  the  user  checks  the  item  he  or  she 
wants  and  runs  an  analysis  (of  any  type). 

•  If  pre-processing  was  performed,  the  list  relates  to  the  stage  of  pre¬ 
processing  that  was  used  for  the  analysis.  If  several  pre-processing 
techniques  were  applied,  analysis  will  always  be  run  on  the  highest  stage  of 
pre-processing. 

• 

•  The  requested  lists  and  matrices  are  automatically  saved  in  a  folder  called 
Term  Distribution  Lists  and  Matrices  under  the  root  directory  of  AutoMap.  This 
folder  is  overwritten  with  every  new  analysis  you  run.  If  you  want  to  save  the 
results  of  a  current  Term  Distribution  Lists  and  Matrices  folder,  simply  rename 
that  folder. 

•  Two  output  lists  are  generated  for  each  Term  Distribution  List  checked. 

•  One  output  matrix  is  generated  for  each  Term  Distribution  Matrix  checked. 

The  following  chart  lists  the  types  and  content  of  Term  Distribution  Lists  and  Matrices: 
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Output  Type 


Concepts 

analyzed 


Term 

Distribution 

List 


Concepts  in 
statements 
and  isolates 


Statements 


Name  of  output 

List  of  concept 
analyzed.  CSV 


Statistics  of 
concepts 
analyzed. CSV 


List  of  concept  in 
statements. CSV 


Statistics  of 
concept  in 
statements. CSV 


List  of  isolates  in 
statements. CSV 


Statistics  of 
isolates. CSV 


List  of 

statements. CSV 


Statistics  of 
statements. CSV 


Content  of  output 

Concept,  Text, 
Frequency 
Concept,  Cumulated 
sum  across  text  set. 
Number  of  text  concept 
occurs  in.  Percentage  of 
texts  concept  occurs  in. 
Texts 

Concept,  Text, 
Frequency 
Concept,  Cumulated 
sum  across  text  set. 
Number  of  text  concept 
occurs  in.  Percentage  of 
texts  concept  occurs  in. 
Texts 

Concept,  Text, 
Frequency 

Concept,  Cumulated 
sum  across  text  set. 
Number  of  text  concept 
occurs  in.  Percentage  of 
texts  concept  occurs  in. 
Texts 

Statement,  Text, 
Frequency 

Statement,  Cumulated 
sum  across  text  set. 
Number  of  text 
statement  occurs  in. 
Percentage  of  texts 
statement  occurs  in. 
Texts 


Term 

Distribution 

Matrices 


Concepts  in  statements  by  concepts  in  statements: 

Matrix  of  Concept  that 
were  linked  into 
statements  (  first  row) 
by  Concept  that  were 
linked  into  statements  ( 
first  column) 

If  count  was  chosen, 
cells  contain  cumulated 
frequency  of  concept 
If  binary  was  chosen, 
cells  denote  existence 
(1)  or  absence  (0)  of 
concept 

Term(s)  by  text(s): 


Concepts  in 
statements 


Matrix  of 
concepts  in 
statements. CSV 


Concepts  Matrix  of  concept  Matrix  of  Concept  (union 
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analyzed 


Concepts  in 
statements 
and  isolates 


Statements 


analyzed. CSV  of  concepts  listed  in  first 

row)  by  texts  (all  text 
names  listed  in  first 
column) 

If  count  was  chosen, 
cells  contain  cumulated 
frequency  of  concept 
If  binary  was  chosen, 
cells  denote  existence 
(1)  or  absence  (0)  of 
concept 

Matrix  of  Concept  (union 
of  concepts  listed  in  first 
row)  by  texts  (all  text 
names  listed  in  first 
column) 

Matrix  of  concept  If  count  was  chosen, 
in  statements. CSV  cells  contain  cumulated 
frequency  of  concept 
If  binary  was  chosen, 
cells  denote  existence 
(1)  or  absence  (0)  of 
concept 

Matrix  of  Statements 
(union  of  statements 
listed  in  first  row)  by 
texts  (all  text  names 
listed  in  first  column) 
Matrix  of  If  count  was  chosen, 

statements. CSV  cells  contain  cumulated 

frequency  of  statement 
If  binary  was  chosen, 
cells  denote  existence 
(1)  or  absence  (0)  of 
statement 


Save  your  upcoming  analysis  in  DyNetML  format: 

Before  running  your  Multiple  Map  Analysis,  you  must  specify  that  your  results  will  be 
DyNetML  format  so  that  they  can  eventually  be  read  into  ORA. 

On  the  10.  Output  Options  index  card  in  P2,  scroll  down  to  Additional  Output  Formats 
and  check  the  per  Map  box  next  to  DyNetML  for  Map  Analysis. 

Open  the  Output  Storage  Manager.  (See  section  V.). 

You  will  see  Text  Analysis  Output  Directory  listed  under  the  Analysis  Output  Storage 
category.  Click  the  Save  in  folder. . .  button  on  the  far  right  in  that  row.  In  the  folder 
chooser  window  that  pops  up,  navigate  to  the  folder  you  want. 


2007  AutoMap  Users  Guide  -  Page  99 


Example:  C:\Documents  and  Settings\carley\Desktop\AutomapLessonl\TextOutputFiles 

Click  Open. 

Save  these  settings  and  close  the  Output  Storage  Manager.  (See  section  V.) 

Run  the  Multiple  Map  Analysis: 

In  the  Menu  Bar,  choose  Run  Analysis,  then  Multiple  Map  Analysis. 

The  results  will  be  displayed  on  the  Semantic  Network  of  Current  Text  index  card  in  P3 
and  on  the  2. Statistics  index  card  in  P4. 

Below  is  a  screen  capture  showing  the  results  of  Multiple  Map  Analysis  in  P3  and  P4: 


If  you  have  analyzed  multiple  texts,  you  can  browse  through  the  results  and  see  the 
related  texts.  Results  of  multiple  analyses  are  automatically  saved  in  a  folder  titled  output 
under  the  directory  where  Automap  is  installed. 

The  output  folders  are  overwritten  with  every  new  analysis  you  run.  If  you  want  to  save 
the  results  of  a  current  analysis  folder,  simply  rename  that  folder. 
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IX.  Open  ORA 

On  Empire: 


•  Navigate  to  C:\Documents  and  Settings\Carley\Desktop\Tools. 

•  Double-click  the  Run  ORA  icon.  The  ORA  interface  pops  up. 

From  anywhere  else:  (requires  a  web  connection) 
http://www.casos.cs.cmu.edu/projects/ora/software.html 

X.  Load  your  files  into  ORA 

Click  the  Load  button  on  the  far  right.  A  file  chooser  window  will  pop  up. 

Choose  Select  Files.  Navigate  to  your  NyNetML  files  (generated  in  section  VIII). 

Example:  C:\Documents  and  Settings\carley\Desktop\AutomapLessonl\TextOutputFiles 

Select  the  files  you  want  by  holding  down  the  key  while  clicking  on  each  one.  Click 
Open,  then  Finish. 

Example:  Select  Textl-in-user's  guide.txt  and  Text2-in-user's  guide.txt 

Note:  If  some  file  is  already  loaded  into  ORA,  choose  the  Append  As 
Additional  Meta-Matrix  option. 

XI.  Visualize  your  meta-matrices 

The  left-hand  portion  of  the  ORA  screen  is  the  ORA  navigator.  In  this  navigator,  select 
the  matrix  you  want  to  visualize  by  clicking  on  it.  Click  the  Visualize  this  meta-matrix 
button  near  the  center  of  the  screen  to  generate  the  visualization. 

Below  are  the  visualized  meta-matrices  for  Textl-in-user's  guide.txt  (this  page)  and 
Text2-in-user's  guide.txt  (next  page): 
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^solved 


blossym 


arren 


(problem 


careen 


garde'n'j 

eabi 


xoncoct 


Textl-in-user's  guide.txt 


dogr 


kprof 


saying 


longer 


unhappy 

with 


LVou 


needed 
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These  meta-matrices  also  can  be  found  in: 


C :\Documents  and  Settings\carley\Desktop\AutomapLesson IVMetaMatrices 


XII.  Run  a  Semantic  Network  Report 


In  ORA's  Menu  Bar,  choose  Analysis,  then  Generate  Reports.  A  window  will  pop  up. 
You  will  see  Select  Report  at  the  top  of  the  window.  Click  the  v-shaped  icon  to  the  right, 
and  select  Semantic  Network  from  the  resulting  drop-down  menu. 


Below  is  a  screen  capture  showing  where  to  find  the  Semantic  Network  Report  in  the 
Generate  Reports  pop-up  window: 


Check  the  two  boxes  for  Text  1 -in-user's  guide.txt  and  Text2-in-user's  guide.txt  inside  the 
field  titled  Select  one  or  more  meta  matrices. 


To  specify  a  filename  for  the  results,  click  the  Browse  button  near  the  bottom  right  corner 
of  the  Generate  Reports  pop-up  window.  Navigate  to  the  desired  location  and  type  in  a 
filename  or  use  the  default  name. 


Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLessonl\SemanticNetworkReport.html 
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Click  the  Next  button,  then  the  Finish  button  at  the  bottom  of  the  Generate  Reports  pop¬ 
up  window. 

ORA  will  run.  An  HTML  file  will  pop  up  displaying  the  results. 

Return  to  the  ORA  window.  A  small  box  titled  Complete  has  popped  up.  Click  the  OK 
button. 

Below  is  the  Semantic  Network  Report  generated  for  Text  1 -in-user's  guide.txt  and  Text2- 
in-user's  guide.txt: 


SEMANTIC-NETWORK  REPORT 

Input  data:  Text  1 -in-user's  guide.txt,  Text2-in-user's  guide.txt 
Start  time:  Tue  Mar  06  10:59:33  2007 

This  is  a  comparison  of  two  semantic  networks.  Each  node  in  the  network  is  considered  a 
Concept,  and  each  edge  a  Statement  connecting  two  concepts.  Statement  weights  are 
interpreted  as  the  number  of  times  the  statement  occured  in  the  underlying  input  text. 


Network 

Concepts 

Statements 

Density 

Textl-in-user's  guide.txt 

74 

200 

0.0370233 

Text2-in-user's  guide.txt 

69 

184 

0.0392157 

Union 

98 

310 

0.032611 

Intersection 

45 

8 

0.0040404 

Symmetric  difference 

The  symmetric  distance  of  network  A  to  network  B  is  a  new  network  that  contains  the 
entities  in  A  that  are  not  in  B. 

Network 

Textl-in-user's  guide.txt  to  Text2-in-user's 
guide.txt 

Text2-in-user's  guide.txt  to  Textl-in-user's 
guide.txt 

Saved  output  networks 

The  summary  statistics  across  all  maps  saved  to:  C:\Documents  and 
Settings\jmcgille_summary_statistics.csv 

Produced  by  ORA  developed  at  CASOS  -  Carnegie  Mellon  University 


Concepts  Statements 


29 

192 

24 

176 
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Ontological  Text  Coding 

1.  Meta  Matrix  Text  Analysis 


Meta  Matrix  Text  Analysis  can  be  run  on  a  text  or  a  set  of  texts  that  were  pre-processed 
with  a  Meta-Matrix  Thesaurus.  It  enables  the  classification  and  analysis  of  concepts  in 
texts  according  to  the  Meta-Matrix  model  ontology  and  categories  of  the  resulting  inter 
and  intra-related  sub-matrices  (Diesner  &  Carley,  2005),  Meta-Matrix  Text  Analysis  and 
the  social  systems  represented  in  texts.  Meta- matrix  based  analysis  of  properties  of  social 
systems  by  investigating  the  inter  and  intra-connections  between  the  matrices  contained 
in  the  meta-matrix  (cells  in  Table  1)  can  provide  insight  into  the  complex  structure  of 
social  systems. 


The  Meta-Matrix  Model  used  in  AutoMap 
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Agen  Knowle 

Resou 

Task/ 

Organizatio 

Location 

Acti  Ro 

Attrib 

Matrix 

t 

dge 

rce 

Event 

n 

on  le 

ute 

Agent 

Soda 

1 

netw 

ork 

Knowled 

ge 

network 

Capabil 

ity 

networ 

k 

Assignm 

ent 

network 

Membership 

network 

Agent 

location 

network 

Knowled 

ge 


T  £  £  Trainin 

Informat 

g 

ion  ^  . 

^  ,  networ 

network  . 

k 


Knowled 

ge 

require 

ment 

network 


Organization 
al  knowledge 
network 


Knowledg 
e  location 
network 


Resource 


Resour 

ce 

networ 

k 


Resourc 

e 

require 

ment 

network 


Organization 
al  capability 
network 


Resource 

location 

network 


Task/ 

Event 


Precede 

nee 

network 


Organization 

al 

assignment 

network 


Task/Even 
t  location 
network 


Organize 

tion 


Interorganiza 

tional 

network 


Organizati 

onal 

location 

network 


Location 


Proximity 

network 


Action 
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Before  you  run  Meta-Matrix  analysis  make  sure  that  you  have  completed  the  following 
actions: 


•  Pre-processed  the  text(s)  with  a  Meta-Matrix  Thesaurus.  If  you  also  want  to 
use  a  Delete  List  and  /  or  a  Generalization  Thesaurus  make  sure  to  use  these 
tools  before  you  use  the  Meta-Matrix  Thesaurus. 

•  Specified  the  analysis  settings.  If  you  do  not  do  so,  the  default  setting  default 
settings  will  be  applied. 

•  Specified  additional  output  options  if  you  wish  to.  If  you  do  not  select 
additional  output  options,  the  default  outputs  default  outputs  will  be 
generated. 

Analyzing  Text(s) 

If  you  wish  to  analyze  a  single  text,  click  the  Run  Analysis  menu  and  select  Single  Meta 
Matrix  Text  Analysis. 

If  you  wish  to  analyze  a  set  of  texts,  click  the  Run  Analysis  menu  and  select  Multiple 
Meta  Matrix  Text  Analysis. 

The  Results  will  be  displayed  on  P3  on  the  Map  tab  and  on  P4  on  the  Stat  tab 
respectively.  If  you  had  requested  additional  outputs,  those  will  be  generated  and  stored 
under  the  directories  specified  under  the  Section  Additional  Outputs. 

If  you  have  analyzed  multiple  texts,  you  can  browse  through  the  results  and  see  the 
related  texts. 

Results  of  multiple  analyses  are  automatically  saved  in  a  folder  named  “output”  under  the 
directory  where  AutoMap  2.0  is  installed.  This  output  folder  contains  a  map  file 
(nameOfText.map)  and  a  stat  file  (nameOfText.stat)  for  each  text  analyzed  as  well  as 
stat_output.xls  file  a  that  contains  the  stats  of  all  texts. 

The  “Stat  Output”  folder  is  overwritten  with  every  new  analysis  you  run.  So  if  you  want 
to  save  the  results  of  a  current  “Stat  Output”  folder  just  rename  the  folder. 

1. 1  Example  for  Meta  Matrix  Text  Analysis 

An  extract  from  the  Our  Text  I.txt  was  used  as  input: 

Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs.  Brown  was  unhappy  with 
the  dog.  She  yelled  at  it  saying  "You  impossible  dog!"  But  the  dog  kept  eating  the 
flowers  and  weeds.  She  asked  Mr.  Cray  to  stop  the  dog.  He  couldn't.  Mrs.  Brown 
planted  roses  and  weeded  the  garden.  The  silly  dog  %  dug  up  the  roses  looking  for  a 
vole  on  June  12,  1880.  Weeding  was  no  longer  needed. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  the  next  day  to  concoct  a  plan.  John  Darren 
and  Mrs.  Brown  put  up  a  scarecrow.  She  thought  it  would  scare  the  dog.  Mr.  Craye 
put  up  a  fence.  Problem  solved.  Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils, 
and  roses. 
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The  text  was  pre-proeessed  with  the  Meta-Matrix-Thesaurus.  This  are  the  resulting  texts: 


Setting 


Resulting  text 


mr  .  <  agent>  <  agent>  dog  ate  lotus  <  event>  < 
attribute>  am  .  mrs  .  <  agent>  was  unhappy  with  dog  . 
yelled  saying  <  agent  resource>  impossible  dog!"  dog  kept 
eating  flowers  weeds  .  asked  mr  .  cray  stop  dog  .  couldn't  . 
mrs  .  <  agent>  planted  roses  weeded  garden  .  silly  dog  < 
Not  Selected  -  Thesaurus  attribute>  dug  up  roses  looking  vole  on  June  <  attribute>  , 
content  only  (default)  <  attribute>  .  weeding  was  no  longer  needed  .  prof  .  < 

agent>  ,  mrs  .  <  agent>  <  attribute>  mr  .  cray  met  next 
day  <  task>  plan  .  John  <  agent>  mrs  .  <  agent>  put  up 
scarecrow  .  thought  scare  dog  .  mr  .  <  agent>  put  up 
fence  .  problem  solved  .  then  mrs  .  <  agent>  planted  lotus 
,  <  resource>  ,  <  resource>  ,  roses  . 


Thesaurus  content  only, 
Direct  Adjacency 
(default  if  Thesaurus 
content  only  is  chosen) 


.  <  agent>  <  agent>  <  event>  <  attribute>  .  .  <  agent>  . 

<  agent  resource> . <  agent>  .  <  attribute>  < 

attribute>  <  attribute>  .  .  .  <  agent>  .  <  agent>  < 
attribute>  .  <  task>  .  <  agent>  .  <  agent>  .  .  .  <  agent> 

.  .  .  <  agent>  <  resource>  <  resource>  . 


XXX  .  <  agent>  <  agent>  xxx  xxx  xxx  <  event>  < 
attribute>  xxx  .  xxx  .  <  agent>  xxx  xxx  xxx  xxx  .  xxx  xxx 

<  agent  resource>  xxx  xxx  xxx  xxx  xxx  xxx  xxx  .  xxx  xxx  . 
xxx  xxx  xxx  .  xxx  .  xxx  .  <  agent>  xxx  xxx  xxx  xxx  .  xxx 

Thesaurus  content  only,  xxx  <  attribute>  xxx  xxx  xxx  xxx  xxx  xxx  xxx  <  attribute> 
Rhetorical  Adjacency  <  attribute>  .  xxx  xxx  xxx  xxx  xxx  .  xxx  xxx  xxx  .  < 

agent>  xxx  .  <  agent>  <  attribute>  xxx  .  xxx  xxx  xxx  xxx 

<  task>  xxx  .  xxx  <  agent>  xxx  .  <  agent>  xxx  xxx  xxx  . 
xxx  xxx  xxx  .  xxx  .  <  agent>  xxx  xxx  .  xxx  xxx  .  xxx  xxx  . 

<  agent>  xxx  xxx  <  resource>  <  resource>  xxx  . 


Then  we  run  Map  Analysis  on  both  texts  using  AutoMap's  default  Analysis  Settings. 
These  are  the  results: 


Not  Selected  -  Thesaurus 
content  only  (default) 

1  10  am 
1  12  1880 
1  1880  weeding 
1  am  mrs 
1  asked  mr 
1  ate  lotus 
1  blossom  10 
1  brown  was 
1  carnations  daffodils 
1  concoct  plan 
1  couldn't  mrs 
1  Cray  stop 
1  Cray's  brown 
1  craye  put  up 


Thesaurus  content  only. 
Direct  Adjacency 

1  10  am 
1  12  1880 
1  1880  weeding 
1  am  mrs 
1  asked  mr 
1  ate  lotus 
1  blossom  10 
1  brown  was 
1  carnations  daffodils 
1  concoct  plan 
1  couldn't  mrs 
1  Cray  stop 
1  Cray's  brown 
1  craye  put  up 


Thesaurus  content  only. 
Rhetorical  Adjacency 

1  blossom  10 
1  brown  was 
1  carnations  daffodils 
1  concoct  plan 
1  couldn't  mrs 
1  Cray  stop 
1  Cray's  brown 
1  craye  put  up 
1  daffodils  roses 
1  day  concoct 
1  dog  yelled 
1  dug  up 
1  eating  flowers 
1  fence  problem 
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1  daffodils  roses 

1  daffodils  roses 

1  flowers  weeds 

1  day  concoct 

1  day  concoct 

1  garden  silly 

1  dog  yelled 

1  dog  yelled 

1  impossible  dog 

1  dug  up 

1  dug  up 

1  John  darren 

1  eating  flowers 

1  eating  flowers 

1  June  12 

1  fence  problem 

1  fence  problem 

1  kept  eating 

1  flowers  weeds 

1  flowers  weeds 

1  longer  needed 

1  garden  silly 

1  garden  silly 

1  looking  vole 

1  impossible  dog 

1  impossible  dog 

1  lotus  carnations 

1  John  darren 

1  John  darren 

1  met  next 

1  June  12 

1  June  12 

1  mr  craye 

1  kept  eating 

1  kept  eating 

1  needed  prof 

1  longer  needed 

1  longer  needed 

1  next  day 

1  looking  vole 

1  looking  vole 

1  no  longer 

1  lotus  carnations 

1  lotus  carnations 

1  on  June 

1  met  next 

1  met  next 

1  plan  John 

1  mr  craye 

1  mr  craye 

1  planted  roses 

1  needed  prof 

1  needed  prof 

1  problem  solved 

1  next  day 

1  next  day 

1  prof  darren 

1  no  longer 

1  no  longer 

1  put  up 

1  on  June 

1  on  June 

1  put  up  fence 

1  plan  John 

1  plan  John 

1  roses  weeded 

1  planted  roses 

1  planted  roses 

1  saying  you 

1  problem  solved 

1  problem  solved 

1  scare  dog 

1  prof  darren 

1  prof  darren 

1  scarecrow  thought 

1  put  up 

1  put  up 

1  silly  dog 

1  put  up  fence 

1  put  up  fence 

1  solved  then 

1  roses  weeded 

1  roses  weeded 

1  stop  dog 

1  saying  you 

1  saying  you 

1  then  mrs 

1  scare  dog 

1  scare  dog 

1  thought  scare 

1  scarecrow  thought 

1  scarecrow  thought 

1  unhappy  with 

1  silly  dog 

1  silly  dog 

1  up  scarecrow 

1  solved  then 

1  solved  then 

1  vole  on 

1  stop  dog 

1  stop  dog 

1  was  unhappy 

1  then  mrs 

1  then  mrs 

1  weeded  garden 

1  thought  scare 

1  thought  scare 

1  weeding  was 

1  unhappy  with 

1  unhappy  with 

1  weeds  asked 

1  up  scarecrow 

1  up  scarecrow 

1  with  dog 

1  vole  on 

1  vole  on 

1  yelled  saying 

1  was  unhappy 

1  was  unhappy 

1  you  impossible 

1  weeded  garden 

1  weeded  garden 

2  brown  planted 

1  weeding  was 

1  weeding  was 

2  darren  mrs 

1  weeds  asked 

1  weeds  asked 

2  mr  Cray 

1  with  dog 

1  with  dog 

5  mrs  brown 

1  yelled  saying 

1  yelled  saying 

1  you  impossible 

1  you  impossible 

2  brown  planted 

2  brown  planted 

2  darren  mrs 

2  darren  mrs 

2  mr  Cray 

2  mr  Cray 

5  mrs  brown 

File: 

#  of  concepts  analyzed: 

File: 

File: 
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unique:  62 
total:  88 

#  of  concepts  in  statements: 
unique:  80 

total:  87 

#  of  isolated  concepts: 
unique:  0 

total:  0 

#  of  statements: 
unique:  80 
total:  87 

Density  (based  on 
Statements): 
unique:  1.29 
total:  1.4 


#  of  concepts  analyzed: 
unique:  62 

total:  88 

#  of  concepts  in 
statements: 
unique:  80 
total:  87 

#  of  isolated  concepts: 
unique:  0 

total:  0 

#  of  statements: 
unique:  80 
total:  87 

Density  (based  on 
Statements): 
unique:  1.29 
total:  1.4 


#  of  concepts  analyzed: 
unique:  62 

total:  88 

#  of  concepts  in  statements: 
unique:  80 

total:  87 

#  of  isolated  concepts: 
unique:  0 

total:  0 

#  of  statements: 
unique:  80 
total:  87 

Density  (based  on 
Statements): 
unique:  1.29 
total:  1.4 


Note:  For  more  information  about  the  impact  of  coding  choices  on  map  analysis  results 
please  visit  us  on  the  web 

http://www.casos.cs.cmu.edu/projects/automap/publications.html). 

2.  Sub  Matrix  Text  Analysis 

Sub  Matrix  Text  Analysis  distills  one  or  several  sub-networks  from  the  meta-matrix 
network  and  retranslates  the  meta-matrix  entities  into  the  text-level  concepts  that 
represent  these  Meta-Matrix  categories.  This  routine  enables  a  more  thorough  analysis  of 
particular  cells  of  the  meta-matrix  (Diesner  &  Carley,  2004c).  Sub  Matrix  Text  Analysis 
can  be  run  on  a  text  or  a  set  of  texts  that  were  pre-processed  with  a  Meta-Matrix 
Thesaurus  and  from  that  Sub-Matrices  were  selected. 

Before  you  run  Sub-Matrix  analysis  make  sure  that  you  have  completed  the  following 
actions: 

•  Pre-processed  the  text(s)  with  a  Meta-Matrix  Thesaurus.  If  you  also  want  to  use  a 
Delete  List  and/  or  a  Generalization  Thesaurus  make  sure  to  use  these  tools  before 
you  use  the  Meta-Matrix  Thesaurus. 

•  Selected  Sub-Matrix  Categories. 

•  Specified  the  analysis  settings.  If  you  do  not  do  so,  the  default  setting  default 
settings  will  be  applied. 

•  Specified  additional  output  options  if  you  wish  to.  If  you  do  not  select  additional 
output  options,  the  default  outputs  default  outputs  will  be  generated. 
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The  user  is  given  the  option  to  exclude  entities  of  any  applied  ontology  from  being 
considered  as  nodes  and  thus  forming  statements,  but  are  considered  as  attributes  of  other 
entities  in  the  ontology  that  are  forming  entities  and  link  into  statements.  An  example 
would  be  the  category  “attribute”,  which  would  represent  information  that  is  inherent  to  a 
certain  entity.  The  concept  “teacher”  or  “male”  might  be  considered  as  attributes  that 
relate  to  the  entity  agent. 

To  use  ontologies  (in  a  way  ontologies  not  only  represent  entities,  but  also  features  of 
entities)  follow  these  steps: 

1 .  Use  the  drag  and  drop  labeled  Concepts  not  forming  edges  at  the  bottom  of  P2, 
tab  no  10.  Output  Options.  Click  Refresh  to  automatically  generate  a  list  of  all 
entities  in  the  applied  ontology. 

2.  Use  the  >  and  <  buttons  to  move  entities  from  one  window  to  the  other. 

3.  Apply  your  decision  by  hitting  the  Apply  Selection  button. 

4.  Before  running  Sub-Matrix  Text  Analysis  decide  whether  you  want  to  perform 
“Network  Text  Analysis”  (NTA)  or  “Social  Network  Text  Analysis”  (SNTA).  To 
select  a  type  use  the  toggle  button  at  the  bottom  of  P2,  10.  Output  Options  index 
card.  Both  types  are  a  form  of  Sub-Matrix  Text  Analysis,  but  differ  in  what  they 
measure: 

1.  NTA:  If  one  measures  textual  network,  then  entities  of  the  applied 
ontology  that  should  not  be  considered  as  nodes,  but  as  inherent 
information  of  nodes,  fill  structural  position  and  semantic  function  in  a 
text.  Thus  they  contribute  to  a  texts'  density,  and  therefore  should  be  taken 
into  consideration  as  statements  for  maps  and  stats.  The  resulting 
DyNetML  file  will  contain  nodes  and  inherent  information  on  nodes  (if 
any  contained  in  the  window  that  an  entity  and  an  attribute  co-occur). 

2.  SNTA:  If  one  measures  a  social  network  that  is  represented  in  or  extracted 
from  a  text,  then  links  from  entities  of  the  applied  ontology  that  should  not 
be  considered  as  nodes,  but  as  inherent  information  of  nodes,  to  actual 
other  nodes  are  not  to  be  represented  in  the  maps  and  stats,  since  this 
would  over  fit  the  social  network.  The  resulting  DyNetML  file  will  NOT 
inherent  information  on  nodes. 

Whether  to  use  NTA  or  SNTA  is  a  “text-philosophical  question”  that's 
answer  depends  upon  what  the  user  wants  to  measure  -  a  textual  network 
or  a  social  network. 

5.  Run  Sub-Matrix  Text  Analysis. 

If  you  wish  to  analyze  a  single  text,  click  the  Run  Analysis  menu  and  select  Single  Meta- 
Matrix  analysis. 

If  you  wish  to  analyze  a  set  of  texts,  click  the  Run  Analysis  menu  and  select  Multiple 
Meta-Matrix  Analysis. 

The  Results  will  be  displayed  on  P3  on  the  Map  index  card  and  on  P4  on  the  Stat  index 
card. 
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If  you  had  requested  additional  outputs,  those  will  be  generated  and  stored  under  the 
directories  specified  under  the  Section  Additional  Outputs. 

If  you  have  analyzed  multiple  texts,  you  can  browse  through  the  results  and  see  the 
related  texts. 

Results  of  multiple  analysis  are  automatically  saved  in  a  folder  named  “output”  under  the 
directory  where  AutoMap  2.0  is  installed.  This  output  folder  contains  a  map  file 
(nameOfText.map)  and  a  stat  file  (nameOfText.stat)  for  each  text  analyzed  as  well  as 
stat_output.xls  file  a  that  contains  the  stats  of  all  texts. 

The  “Stat  Output”  folder  is  overwritten  with  every  new  analysis  you  run.  So  if  you  want 
to  save  the  results  of  a  current  “Stat  Output&RDquo;  folder  just  rename  the  folder. 

2. 1  Example  for  Sub  Matrix  Text  Analysis 

This  example  is  based  on  the  example  for  Sub  Matrix  Selection. 

Our  Text  I.txt  was  used  as  input: 

Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs.  Brown  was  unhappy  with 
the  dog.  She  yelled  at  it  saying  "You  impossible  dog!"  But  the  dog  kept  eating  the 
flowers  and  weeds.  She  asked  Mr.  Cray  to  stop  the  dog.  He  couldn't.  Mrs.  Brown 
planted  roses  and  weeded  the  garden.  The  silly  dog  %  dug  up  the  roses  looking  for  a 
vole  on  June  12,  1880.  Weeding  was  no  longer  needed. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  the  next  day  to  concoct  a  plan.  John  Darren 
and  Mrs.  Brown  put  up  a  scarecrow.  She  thought  it  would  scare  the  dog.  Mr.  Craye 
put  up  a  fence.  Problem  solved.  Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils, 
and  roses. 

The  text  was  pre-processed  with  the  Meta-Matrix-Thesaurus.  Then  we  selected  the  full 
meta- matrix.  These  are  the  resulting  texts: 

Resulting  text 

.  dog  lotus.,  dog.  dog  flowers.,  dog...  roses,  dog  roses,.. 

., . dog . lotus,  carnations,  daffodils,  roses. 

XXX.  XXX  XXX  dog  XXX  XXX  lotus  XXX  XXX  XXX  XXX.  XXX. 

XXX  XXX  XXX  XXX  XXX  dog.  XXX  XXX  XXX  XXX  XXX  XXX  XXX 
XXX  XXX  XXX  dog  XXX  XXX  XXX  flowers  xxx  xxx.  xxx  xxx 
XXX.  xxx  xxx  xxx  xxx  dog.  xxx  xxx.  xxx.  xxx  xxx  roses 
xxx  xxx  xxx  xxx.  xxx  xxx  dog  xxx  xxx  xxx  xxx  roses  xxx 
xxx  xxx  xxx  xxx  xxx  xxx,  xxx.  xxx  xxx  xxx  xxx  xxx.  xxx 
xxx  xxx.  xxx,  xxx.  xxx  xxx  xxx.  xxx  xxx  xxx  xxx  xxx 
xxx  xxx  xxx  xxx.  xxx  xxx  xxx  xxx.  xxx  xxx  xxx  xxx  xxx. 
xxx  xxx  xxx  xxx  xxx  xxx  dog.  xxx.  xxx  xxx  xxx  xxx.  xxx 
xxx.  xxx  xxx.  xxx  xxx  lotus,  carnations,  daffodils,  xxx 


Setting 

Select: 

Knowledge/Agent/Resource 


Select: 

Kn  0  w  I  ed  g  e/ Ag  e  nt/  Reso  u  rce 
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roses.  XXX 


Then  we  run  Map  Analysis  on  both  texts  using  AutoMap's  default  Analysis  Settings. 
These  are  the  results: 


1  lotus  carnations 

1  12  pm 

1  met  the 

1  13  1880 

1  mr  craye 

1  1880  prof 

1  neededprof  darren 

1  and  weeds 

1  next  day 

1  at  12 

1  no  longer 

1  ate  the 

1  on  June 

1  blossom  at 

1  plan  John 

1  brown  would 

1  planted  roses 

1  carnations  daffodils 

1  problem  solved 

1  Cray  was 

1  put  up 

1  Cray's  brown 

1  put  up  a 

1  craye  painted 

1  roses  looking 

1  daffodils  and 

1  saying  you 

1  darren  mrs 

1  scare  the 

1  day  mrs 

1  scarecrow  she 

1  dinner  and 

1  she  yelled 

1  discussed  how 

1  silly  dog 

1  dog  the 

Map  1  solved  then 

1  down  the 

1  stop  the 

1  eating  the 

1  the  silly 

1  fence  then 

1  then  mrs 

1  fertilized  the 

1  thought  it 

1  flowers  and 

1  to  stop 

1  following  week 

1  unhappy  with 

1  for  the 

1  up  the 

1  garden  on 

1  vole  on 

1  good  dog 

1  was  unhappy 

1  had  worked 

1  weeded  the 

1  happy  with 

1  weeding  was 

1  his  fence 

1  weeds  she 

1  how  the 

1  with  the 

1  it  was 

1  would  scare 

1  John  darren 

1  yelled  at 

1  June  13 

1  you  impossible 

1  longer  ate 

2  brown  planted 

1  lotus  carnations 

2  mr  Cray 

1  met  over 
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Stat 


4  the  dog 

5  mrs  brown 


#  of  concepts  analyzed: 
unique:  72 

total:  118 

#  of  concepts  in  statements: 
unique:  108 
total:  117 

#  of  isolated  concepts: 
unique:  0 
total:  0 

#  of  statements: 
unique:  108 
total:  117 


1  mr  craye 
1  next  day 
1  no  longer 
1  now  happy 
1  on  June 
1  over  dinner 
1  painted  his 
1  plan  had 
1  pleased  too 
1  pm  the 
1  prof  darren 
1  roses  and 
1  said  you 
1  scarecrow  the 
1  scary  for 
1  she  thought 
1  stopped  eating 
1  take  down 
1  the  scarecrow 
1  then  mrs 
1  thought  it 
1  too  scary 
1  was  too 
1  watered  the 
1  weeds  mr 
1  week  she 
1  with  the 
1  worked  John 
1  would  take 

1  you  good 

2  brown  watered 

2  mr  Cray 

3  the  dog 

5  mrs  brown 

#  of  concepts  analyzed: 
unique:  68 

total:  107 

#  of  concepts  in  statements 
unique:  98 

total:  106 

#  of  isolated  concepts: 
unique:  0 

total:  0 

#  of  statements: 
unique:  98 
total:  106 
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Density  (based  on  Statements):  Density  (based  on  Statements): 
unique:  1.5  unique:  1.44 

total:  1.63  total:  1.56 
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Outputs 


Outputs  for  Map  Analysis,  Meta  Matrix  Text  Analysis  and  Sub  Matrix  Text  Analysis  are 
displayed  on  P3  on  the  Semantic  Network  index  card  and  on  P4  on  the  Stat  index  card. 

If  you  have  analyzed  multiple  texts,  you  can  browse  through  the  results  and  see  the 
related  texts. 

Results  of  multiple  analyses  are  automatically  saved  in  a  folder  named  “output”'  under 
the  directory  where  AutoMap  2.0  is  installed.  This  output  folder  contains  a  map  file 
(nameOfText.map)  and  a  stat  file  (nameOfPext.stat)  for  each  text  analyzed  as  well  as  a 
stat_output.xls  file  that  contains  the  stats  of  all  texts.  Additional  Outputs  will  be 
generated  if  requested  by  the  user. 

1.  Semantic  Network 

After  running  analysis,  the  semantic  network  will  be  displayed  on  P3  on  the  Semantic 
Network  Index  card. 

The  semantic  network  contains  one  coded  statement  per  line. 

If  the  Frequency  item  was  checked  on  the  Analysis  Settings  index  card  the  first  column 
of  the  semantic  network  indicates  the  frequency  of  every  displayed  statement. 

Each  semantic  network  generated  is  automatically  saved  in  a  folder  named  “Stat  Output” 
under  the  directory  where  AutoMap  1.2  is  installed.  This  output  folder  contains  a 
semantic  network  file  (nameOfText.map)  and  a  stat  file  (nameOfText.stat)  for  each  text 
analyzed  as  well  as  a  stat_output.xls  file  that  contains  the  stats  of  all  texts. 

1. 1  Example  for  Semantic  Network  Output 

For  examples  for  semantic  network  files,  see  the  examples  for  analysis,  the  Semantic 
Network  of  current  text  index  cards. 

Semantic  networks  are  output  as  .CSV  (.map  in  previous  versions). 

2.  Statistics 

After  running  the  analysis,  the  Stat  file  will  be  displayed  on  P4  on  the  Statistics  Index 
Card. 

Each  stat  file  generated  is  automatically  saved  in  a  folder  named  “Stat  Output”  under  the 
directory  where  AutoMap  1.2  is  installed.  This  output  folder  contains  a  map  file 
(nameOfText.map)  and  a  stat  file  (nameOfText.stat)  for  each  text  analyzed  as  well  as 
stat_output.xls  file  a  that  contains  the  stats  of  all  texts. 
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Entries  in  the  stat  output  and  explanation: 


Entry 

Entry 

Explanation 

File: 

Name  of  the  analyzed  text  file. 

unique: 

Unique  concepts  are  those  that  appear  only  once 
in  a  text;  the  number  of  total  concepts  includes 

#  of  concepts 

those  that  appear  more  than  once  in  a  given 

analyzed: 

total: 

text. 

All  concepts  are  considered  that  occurred  in  the 
texts  that  were  analyzed. 

#  of  concepts  in 

unique: 

Only  concepts  are  considered  that  linked  into 

statements: 

total: 

statements. 

#  of  isolated 

unique: 

Only  concepts  are  considered  that  did  not  link 

concepts: 

total: 

into  statements. 

#  of  statements: 

unique: 

Unique  statements  are  those  that  appear  only 
once  in  a  text;  the  number  of  total  statements 

total: 

includes  those  that  appear  more  than  once  in  a 
given  text. 

density 

unique: 

Unique  density  is  the  density  of  the  resulting 
network  based  on  unique  statements,  total 

(based  on 
statements) 

total: 

density  respectively  is  the  density  of  the  resulting 
network  based  on  the  total  number  of 

statements. 

Punctuation: 

The  Punctuation  option  chosen  by  the  user. 

Analysis  Settings  Window  Size: 

The  Window  Size  chosen  by  the  user. 

Directionality: 

The  Directionality  option  chosen  by  the  user. 

2. 1  Example  for  Statistic  Output 

For  examples  of  stat  files,  see  the  examples  for  analysis,  the  Stat  of  current  text  index 
cards. 


3.  Additional  Outputs 

The  map  and  the  statistic  output  generated  by  AutoMap  are  displayed  on  P3  and  P4, 
respectively. 

For  all  types  of  multiple  analysis  a  lot  more  outputs  can  be  generated  on  demand. 

Output  Options 

The  map  and  the  statistic  output  generated  by  AutoMap  are  displayed  on  P3  and  P4, 
respectively.  In  addition  to  that  AutoMap  offers  further  output  options  that  can  be  chosen 
on  P2,  10.  Output  Options  index  card.  All  additional  outputs  are  only  generated  after 
analyses  were  run. 
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’te-fxocessing  and  Analysis  Saltings 

1 .  Corceot  List  I  2.  Union  Concwl  Lisl  I  3  Utites  I  4.Steinirtng  I  5  Delete  LisI  i  6  OnetnaTiainn  ThesniiriK 
7.Meta-Mstrc(  Theseufus  |  8  Siis-MetnxSeiaction  |  9.  Analysis  Settni^  10  OilpiA  OpUrni 

Teim  DstrtoUion  Lists 
n  Ccncspts  anatyzed 
P  Concepts  in  statemems  and  isclales 
P  Slaiemerts 

Tetm  CistftiUion  Maltloes 

concepts  n  ststemerts  by  concepts  in  statements 


r  Ccncspts  In  ststemenls 

COIA* 

C  Wnary 

lerm(s)  by  (exits) 

n  Ccncspts  arotyied 

^  cooni 

C  bnsry 

P  Concepts  in  statemenis  and  isolates 

t*  com! 

('  bSKTY 

l~  Slatemerts 

<♦  coml 

P  Wnary 

Nonldentmed  Concepts 
P  Save  1st  oT  Non-denliried  Concepts 
AdcMnnal  Oiiput  FoimeSs 

UcWel(tH.)  r  Term  Dotibubon  Maliices 

DyNetML  tot  Map  Anelysls  p 

DyNetML  foi  Sub-Matrix  Analysis  p  p  p^,  TexISel 


1.  Term  Distribution  Lists  and  Matrices 

Term  distribution  list  and  matrices  as  output  options  for  all  types  of  multiple  analysis  can 
be  chosen  in  the  upper  two  fields  of  P2,  tab  no.  10.  Output  Options. 

Points  To  Consider 

•  No  list  or  matrix  will  be  generated  by  default. 

•  All  list  or  matrix  are  only  generated  if  the  user  checks  a  list  he  or  she  wants 
to  be  generated  and  runs  an  analysis  of  any  type. 

•  If  pre-processing  was  performed,  the  list  relates  to  the  stage  of  pre¬ 
processing  that  was  used  for  the  analysis.  If  several  pre-processing 
techniques  were  applied,  analysis  will  always  be  run  on  the  highest  stage  of 
pre-processing. 

•  The  requested  lists  and  matrices  will  be  automatically  saved  in  a  folder  called 
Term  Distribution  Lists  and  Matrices  under  the  root  directory  of  AutoMap. 

•  The  Term  Distribution  Lists  and  Matrices  folder  is  overwritten  with  every  new 
analysis  you  run.  So  if  you  want  to  save  the  results  of  a  current  Term 
Distribution  Lists  and  Matrices  folder  just  rename  the  folder. 

For  each  Term  Distribution  List  checked  two  output  lists  are  generated. 
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•  For  each  Term  Distribution  Matrix  checked  one  output  matrix  is  generated. 
See  the  output  examples  for  the  content  of  each  list. 


1. 1  Types  and  content  of  Term  Distribution  Lists  and  Matrices 


Output  Type 


Concepts 

analyzed 


Name  of  output  Content  of  output 


List  of  concept 
analyzed. CSV 

Statistics  of 
concepts 
analyzed. CSV 


Concept,  Text,  Frequency 

Concept,  Cumulated  sum  across  text  set. 
Number  of  text  concept  occurs  in. 
Percentage  of  texts  concept  occurs  in. 
Texts 


List  of  concept 

in  Concept,  Text,  Frequency 

statements. CSV 


Concepts 
Term  in 

Distribution  statements 
List  and 

isolates 


Statistics  of 
concept  in 
statements. CSV 

List  of  isolates 
in 

statements. CSV 


Concept,  Cumulated  sum  across  text  set. 
Number  of  text  concept  occurs  in. 
Percentage  of  texts  concept  occurs  in. 
Texts 

Concept,  Text,  Frequency 


Concept,  Cumulated  sum  across  text  set. 
Statistics  of  Number  of  text  concept  occurs  in, 
isolates. CSV  Percentage  of  texts  concept  occurs  in. 
Texts 


List  of 

statements. CSV 


Statement,  Text,  Frequency 


Statements 

Statistics  of 
statements. CSV 


Statement,  Cumulated  sum  across  text 
set.  Number  of  text  statement  occurs  in. 
Percentage  of  texts  statement  occurs  in. 
Texts 


Term 

Distribution 

Matrices 


Concepts  in  statements  by  concepts  in  statements: 

Matrix  of  Concept  that  were  linked  into 
statements  (  first  row)  by  Concept  that 
Concepts  Matrix  of  were  linked  into  statements  (first  column 

in  concepts  in  If  count  was  chosen,  cells  contain 

statements  statements. csv  cumulated  frequency  of  concept 

If  binary  was  chosen,  cells  denote 
existence  (1)  or  absence  (0)  of  concept 


Term(s)  by  text(s): 


Concepts 

analyzed 


Matrix  of  Concept  (union  of  concepts 
listed  in  first  row)  by  texts  (all  text 
Matrix  of  names  listed  in  first  column) 

concept  If  count  was  chosen,  cells  contain 

analyzed. CSV  cumulated  frequency  of  concept 

If  binary  was  chosen,  cells  denote 
existence  (1)  or  absence  (0)  of  concept 


Concepts  Matrix  of 
in  concept  in 


Matrix  of  Concept  (union  of  concepts 
listed  in  first  row)  by  texts  (all  text 
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names  listed  in  first  column) 

If  count  was  chosen,  cells  contain 
cumulated  frequency  of  concept 
If  binary  was  chosen,  cells  denote 
existence  (1)  or  absence  (0)  of  concept 
Matrix  of  Statements  (union  of 
statements  listed  in  first  row)  by  texts 
(all  text  names  listed  in  first  column) 

If  count  was  chosen,  cells  contain 
cumulated  frequency  of  statement 
If  binary  was  chosen,  cells  denote 
existence  (1)  or  absence  (0)  of  statement 

1.2  Example  for  Term  Distribution  List  and  Matrices 

Extracts  from  the  Denmark.txt  and  USA.txt  files  were  used  as  input: 

Our  Text  l.txt:  Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs.  Brown 
was  unhappy  with  the  dog.  She  yelled  at  it  saying  "You  impossible  dog!"  But  the  dog 
kept  eating  the  flowers  and  weeds.  She  asked  Mr.  Cray  to  stop  the  dog.  He  couldn't. 
Mrs.  Brown  planted  roses  and  weeded  the  garden.  The  silly  dog  %  dug  up  the  roses 
looking  for  a  vole  on  June  12,  1880.  Weeding  was  no  longer  needed.  Prof.  Darren, 
Mrs.  Brown  &  Mr.  Cray  met  the  next  day  to  concoct  a  plan.  John  Darren  and  Mrs. 
Brown  put  up  a  scarecrow.  She  thought  it  would  scare  the  dog.  Mr.  Craye  put  up  a 
fence.  Problem  solved.  Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils,  and 
roses. 

Our  Text  Il.txt:  Mr.  Cray's  brown  dog  stopped  eating  the  lotus  blossom  at  12  pm  the 
next  day.  Mrs.  Brown  was  now  happy  with  the  dog.  She  said  "You  good  dog!"  The 
dog  no  longer  ate  the  flowers  and  weeds.  Mr.  Cray  was  pleased  too.  Mrs.  Brown 
watered  the  roses  and  fertilized  the  garden  on  June  13,  1880.  Prof.  Darren,  Mrs. 
Brown  &  Mr.  Cray  met  over  dinner  and  discussed  how  the  plan  had  worked.  John 
Darren  and  Mrs.  Brown  would  take  down  the  scarecrow  the  following  week.  She 
thought  it  was  too  scary  for  the  dog.  Mr.  Craye  painted  his  fence.  Then  Mrs.  Brown 
watered  lotus,  carnations,  daffodils,  and  roses. 

Then  AutoMap's  Extensive  Delete  Eist,  Direct  Adjacency  was  applied  to  both  texts.  The 
Delete  Eist  was  extended  by  further  non-content  bearing  words  that  appeared  in  the 
sample  texts  (a  an  and  as  at  awhile  but  for  from  happening  he  her  her  hers  him  his  i  in 
into  it  its  me  mine  my  nor  of  or  our  she  so  that  the  their  theirs  them  they  to  us  was  we 
were  what  who  whoever  whom  whomever  will  would  you  your  yours  yourself).  Below  is 
the  resulting  texts: 

Our  Text  l.txt:  Mr.  Cray's  brown  dog  ate  lotus  blossom  10  am.  Mrs.  Brown  was 
unhappy  with  dog.  yelled  saying  "You  impossible  dog!"  dog  kept  eating  flowers 
weeds,  asked  Mr.  Cray  stop  dog.  couldn't.  Mrs.  Brown  planted  roses  weeded  garden, 
silly  dog  %  dug  up  roses  looking  vole  on  June  12,  1880.  Weeding  was  no  longer 
needed.  Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  next  day  concoct  plan.  John  Darren 
Mrs.  Brown  put  up  scarecrow,  thought  scare  dog.  Mr.  Craye  put  up  fence.  Problem 
solved.  Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils,  roses. 


statements  statements. csv 

and 

isolates 


Statements 


Matrix  of 
statements. CSV 
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Our  Text  Il.txt:  Mr.  Cray's  brown  dog  stopped  eating  lotus  blossom  12  pm  next  day. 
Mrs.  Brown  was  now  happy  with  dog.  said  "You  good  dog!"  dog  no  longer  ate  flowers 
weeds.  Mr.  Cray  was  pleased  too.  Mrs.  Brown  watered  roses  fertilized  garden  on 
June  13,  1880.  Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  over  dinner  discussed  how 
plan  had  worked.  John  Darren  Mrs.  Brown  take  down  scarecrow  following  week, 
thought  was  too  scary  dog.  Mr.  Craye  painted  fence.  Then  Mrs.  Brown  watered  lotus, 
carnations,  daffodils,  roses. 

Then  semantic  network  analysis  was  run  on  both  texts  using  AutoMap's  default  settings. 
The  black  ellipse  in  the  screen  shot  below  shows  how  to  run  a  Single  Map  Analysis 
(Semantic  Network  Analysis): 


Tools 


Single  Map  Analysis  ^ 

0  to; 

5. 

Single  Meta  Matrix  Text  Analysis 

after  f 

1. 

Multiple  Meta  Matrix  Text  Analysis 

Remo 

Ir . 

Single  Sub  Matrix  Text  Analysis 

eat: 

iro 

Multiple  Sub  Matrix  Text  Analysis 

sa: 

: lowers  weeds.  Mr.  Cray  was  i 

3lea; 

All  outputs  provided  on  the  Output  Options  panel  were  checked.  Again,  the  default 
settings  were  used.  All  output  lists  are  saved  in  this  folder 
(Term_Distribution_Lists_and_Matrices  .zip) . 

2.  Save  Non-ldentified  concepts 

Purpose:  Save  a  list  of  all  concepts  that  are  remaining  in  the  pre-processed  texts  and  that 
are  not: 


•  Denoted  in  a  delete  list 

•  Denoted  in  any  of  the  thesauri 

To  create  a  list  of  these  words,  go  to  on  >span>P2,  tab  no.  4  -  Analysis  Settings,  tab  no.  2 
-  Output  Options  index  card,  select  the  Non-ldentified  Concepts  field  and  check  Save  list 
of  Non-ldentified  Concepts.  Run  any  type  of  Analysis.  The  list  will  be  stored  under  the 
root  directory  of  AutoMap  as  Non_identified_concepts.csv.  The  black  ellipse  in  the 
screen  shot  below  shows  where  to  find  this  option. 
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1.  Concept  List  |  2.  Union  Concept  List  |j  3.  Pre-Processing  Settings  Analysis  Settings 


1.  Analysis  Settings  j  2.  Output  Options 


Additional  Output  Options  per  File 
I  I  Semantic  Network  (as  .csv) 

Non-Identifie^onc^t^^^ 

3 


Additional  Output  Formats 
UciNet  (DL) 

DyNetML  for  Map  Analysis 
DyNetML  for  Sub-Matrix  Analysis 

Network  analytic  measures 
I  I  Measures 

Sub  Matrix  Analysis  Options 


I  I  Statistics  (as  .csv) 


r~|  Maps 
I  I  per  Map 
I  I  per  Text 


r~l  Term  Distribution  Matrices 


[~~|  per  TextSet 


3.  Additional  Output  Formats 

Two  additional  data  formats  are  offered: 

•  DL  for  e.g.  UCINET  (Borgatti,  Everett  &  Freeman,  2002;  for  more  information 
about  UCINET  see  http://www.analytictech.com/ucinet.htm). 

•  DyNetML  (Tsvetovat,  Reminga  &  Carley,  2004;  for  more  information  about 
DyNetML  see  http://www.casos.cs.cmu.edu/projects/dynetml/). 

Note:  You  can  also  user  the  Network  Converter  to  convert  data. 

3.1  DL: 

If  maps  and/or  term  distribution  matrices  are  generated,  these  files  can  be  additionally 

stored  in  the  UCINET  DL  format. 

Some  points  to  consider  when  storing  in  UCINET  DL  Eormat: 

•  In  order  to  do  this  check  "Maps"  and/  or  "Term  distribution  matrices"  items  on 
the  "Additional  Output  Formats"  field  on  P2,  tab  no.  10  Output  Options  index 
card. 

•  The  maps  are  stored  with  the  extension  .dl  in  the  "DL_UciNet_Format"  folder 
in  the  root  directory  of  AutoMap. 

•  The  term  distribution  matrices  are  stored  with  the  extension  .dl  in  the 
"DL_UciNet_Format"  folder  in  the  root  directory  of  AutoMap. 

•  If  you  wish  to  perform  UCINET  analysis  on  your  AutoMap  results,  import  the 
data  into  UCINET.  If  you  wish  to  visualize  your  AutoMap  results,  import  the 
data  into  UCINET  and  use  the  visualization  tools  provided  through  UCINET. 
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Note:  Since  networks  extracted  with  AutoMap  are  directed,  matrices 
representing  these  networks  can  be  rectangular.  If  a  DL  file  needs  to 
be  generated  from  a  rectangular  matrix  AutoMap  by  default  squares 
this  matrix  before  converting  it  into  DL  format. 

3.2  DyNetML: 

Results  of  Map  Analysis  and  Sub-Matrix  Text  Analysis  can  be  output  in  DyNetML 
format. 

Purpose:  Generate  DyNetML  representation  of  maps  (mental  models)  generated  with 
Map  Analysis. 

How  to:  Check  the  "per  Map"  checkbox  on  P2,  10.  Output  Options  index  card, 

Additional  Output  Formats,  DyNetML  for  Map  Analysis. 

Output:  The  resulting  DyNetML  files  will  be  stored  as  NameOfText.xml  in  the  xml 
folder  under  the  root  directory  of  AutoMap. 

After  Map  Analysis: 

By  default,  all  entities  in  the  applied  ontology  are  considered  as  entities,  and  all 
statements  between  entities  in  the  applied  ontology  (either  in  anterior  or  posterior  or  both 
positions)  are  represented  as  edges  in  DyNetML.  The  user  is  given  the  option  to  exclude 
entities  of  any  applied  ontology  from  being  considered  as  nodes  and  thus  forming 
statements,  but  are  considered  as  attributes  of  other  entities  in  the  ontology  that  are 
forming  entities  and  link  into  statements.  An  example  would  be  the  category  "attribute", 
which  represent  information  that  is  inherent  to  a  certain  entity.  The  concept  "teacher"  or 
"male"  e.g.  might  be  considered  as  attributes  that  relate  to  the  entity  agent.  In  order  to  use 
ontologies  in  a  way  ontologies  that  not  only  represent  entities,  but  also  features  of 
entities,  do  this  follow  this  procedure: 

1.  Use  the  drag  and  drop  labeled  Concepts  not  forming  edges  at  the  bottom  of 
P2,  10.  Output  Options  index  card.  Click  Refresh  to  automatically  generate  a 
list  of  all  entities  in  the  applied  ontology. 

2.  Use  the  >  and  <  buttons  to  move  entities  from  one  window  to  the  other. 

3.  Apply  your  decision  by  hitting  the  Apply  Selection  button. 

4.  Before  running  Sub-Matrix  Text  Analysis  decide  whether  you  want  to  perform 
"Network  Text  Analysis"  (NTA)  or  "Social  Network  Text  Analysis"  (SNTA).  To 
select  a  type  use  the  toggle  button  at  the  bottom  of  P2,  10.  Output  Options 
index  card.  Both  types  are  a  form  of  Sub-Matrix  Text  Analysis,  but  differ  in 
what  they  measure: 

1.  NTA:  If  one  measures  textual  network,  then  entities  of  the  applied 
ontology  that  should  not  be  considered  as  nodes,  but  as  inherent 
information  of  nodes,  fill  structural  position  and  semantic  function  in  a 
text.  Thus  they  contribute  to  a  texts'  density,  and  therefore  should  be 
taken  into  consideration  as  statements  for  maps  and  stats.  The 
resulting  DyNetML  file  will  contain  nodes  and  inherent  information  on 
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nodes  (if  any  contained  in  the  window  that  an  entity  and  an  attribute 
co-occur). 

2.  SNTA:  If  one  measures  a  social  network  that  is  represented  in  or 
extracted  from  a  text,  then  links  from  entities  of  the  applied  ontology 
that  should  not  be  considered  as  nodes,  but  as  inherent  information  of 
nodes,  to  actual  other  nodes  are  not  to  be  represented  in  the  maps 
and  stats,  since  this  would  over  fit  the  social  network.  The  resulting 
DyNetML  file  will  NOT  inherent  information  on  nodes. 

Note:  Whether  to  use  NTA  or  SNTA  is  a  "text-philosophical  question" 
and  the  answer  depends  upon  what  the  user  wants  to  measure:  a 
textual  network  or  a  social  network. 

Run  Sub-Matrix  Text  Analysis. 

Two  options  for  creating  DyNetML  files  are  offered.  Either  one  or  both  options  can  be 
selected  per  analysis: 

1.  Create  one  DyNetML  file  per  map  and  text.  To  do  this,  check  per  Text  in  the 
Additional  Output  Formats  field  on  P2,  10.  Output  Options  index  card. 

The  DyNetML  files  will  be  stored  as  NameOfText.xml  and  in  a  folder  called  xml 
under  the  root  directory  of  AutoMap. 

2.  Create  one  DyNetML  file  that  unifies  all  maps.  To  do  this,  check  per  TextSet  in 
the  Additional  Output  Formats  field  on  P2,  10.  Output  Options  index  card. 

The  DyNetML  file  will  be  stored  as  consolidated_map.xml  in  a  folder  called 
xml  under  the  root  directory  of  AutoMap. 

3.3  Examples  for  additional  Output  Formats 

Extracts  from  the  Our  Text  I.txt  and  Our  Text  Il.txt  files  were  used  as  input: 

Our  Text  I.txt:  Mr.  Cray's  brown  dog  ate  the  lotus  blossom  at  10  am.  Mrs.  Brown 
was  unhappy  with  the  dog.  She  yelled  at  it  saying  "You  impossible  dog!"  But  the  dog 
kept  eating  the  flowers  and  weeds.  She  asked  Mr.  Cray  to  stop  the  dog.  Fie  couldn't. 
Mrs.  Brown  planted  roses  and  weeded  the  garden.  The  silly  dog  %  dug  up  the  roses 
looking  for  a  vole  on  June  12,  1880.  Weeding  was  no  longer  needed.  Prof.  Darren, 
Mrs.  Brown  &  Mr.  Cray  met  the  next  day  to  concoct  a  plan.  John  Darren  and  Mrs. 
Brown  put  up  a  scarecrow.  She  thought  it  would  scare  the  dog.  Mr.  Craye  put  up  a 
fence.  Problem  solved.  Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils,  and 
roses. 

Our  Text  Il.txt:  Mr.  Cray's  brown  dog  stopped  eating  the  lotus  blossom  at  12  pm  the 
next  day.  Mrs.  Brown  was  now  happy  with  the  dog.  She  said  "You  good  dog!"  The 
dog  no  longer  ate  the  flowers  and  weeds.  Mr.  Cray  was  pleased  too.  Mrs.  Brown 
watered  the  roses  and  fertilized  the  garden  on  June  13,  1880.  Prof.  Darren,  Mrs. 
Brown  &  Mr.  Cray  met  over  dinner  and  discussed  how  the  plan  had  worked.  John 
Darren  and  Mrs.  Brown  would  take  down  the  scarecrow  the  following  week.  She 
thought  it  was  too  scary  for  the  dog.  Mr.  Craye  painted  his  fence.  Then  Mrs.  Brown 
watered  lotus,  carnations,  daffodils,  and  roses. 
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Then  AutoMap's  Extensive  Delete  List,  Direct  Adjacency  was  applied  to  both  texts.  The 
Delete  List  was  extended  by  further  non-content  bearing  words  that  appeared  in  the 
sample  texts  (a  an  and  as  at  awhile  but  for  from  happening  he  her  her  hers  him  his  i  in 
into  it  its  me  mine  my  nor  of  or  our  she  so  that  the  their  theirs  them  they  to  us  was  we 
were  what  who  whoever  whom  whomever  will  would  you  your  yours  yourself). 

Resulting  texts: 

Our  Text  I.txt:  Mr.  Cray's  brown  dog  ate  lotus  blossom  10  am.  Mrs.  Brown  unhappy 
with  dog.  yelled  saying  "You  impossible  dog!"  dog  kept  eating  flowers  weeds,  asked 
Mr.  Cray  stop  dog.  couldn't.  Mrs.  Brown  planted  roses  weeded  garden,  silly  dog  % 
dug  up  roses  looking  vole  on  June  12,  1880.  Weeding  no  longer  needed. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  next  day  concoct  plan.  John  Darren  Mrs. 
Brown  put  up  scarecrow,  thought  scare  dog.  Mr.  Craye  put  up  fence.  Problem  solved. 
Then  Mrs.  Brown  planted  lotus,  carnations,  daffodils,  roses. 

Our  Text  Il.txt:  Mr.  Cray's  brown  dog  stopped  eating  lotus  blossom  12  pm  next  day. 
Mrs.  Brown  now  happy  with  dog.  said  "You  good  dog!"  dog  no  longer  ate  flowers 
weeds.  Mr.  Cray  pleased  too. 

Mrs.  Brown  watered  roses  fertilized  garden  on  June  13,  1880. 

Prof.  Darren,  Mrs.  Brown  &  Mr.  Cray  met  over  dinner  discussed  how  plan  had 
worked.  John  Darren  Mrs.  Brown  take  down  scarecrow  following  week,  thought  too 
scary  dog.  Mr.  Craye  painted  fence.  Then  Mrs.  Brown  watered  lotus,  carnations, 
daffodils,  roses. 

Then  map  analysis  was  run  on  both  texts  using  the  AutoMap's  default  settings.  All  term 
distribution  matrices  provided  on  the  Output  Options  panel  were  checked.  Again,  the 
default  settings  were  used.  The  screen  shot  below  shows  the  resulting  analysis  that  should 
be  displayed  in  P3. 
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4.  Network  Analytic  Measures 

AutoMap  supports  the  computation  of  network  analytic  measures  per  map  or  network  and 
per  concept  or  node. 

Note:  Only  Network  Analytic  Measures  for  directed  networks  were 
implemented  into  AutoMap.  The  reason  for  this  is  that  AutoMap 
outputs  are  always  directed  in  order  to  adequately  represent  the 
linear  structure  of  texts. 

To  create  measures  follow  these  steps: 

1.  On  P2,  tab  no.  10  Output  Options,  select  the  Network  analytic  measures  field 
and  check  Measures. 

2.  After  analysis  is  run,  measures  will  be  displayed  on  P4,  2.  Network  analytic 
measures  index  card. 

A  file  that  collects  all  measures  per  text  set  will  be  saved  as 
overall_network_analytix_measures.csv  in  the  results  folder  under  the  root 
directory  of  AutoMap. 

Note:  Texts  have  a  linear  structure.  Therefore,  we  only  implemented 
Network  Analytic  Measures  for  directed  networks  (digraphs)  into 
AutoMap. 

The  following  Table  explains  the  measures  that  can  be  computed: 


Name  of 
measure 


Calculation  of  measure 


Name  of  and  reference 
for  corresponding  SNA 
measure 


Corresponding 
name  of 
measure  in 
Cube 


Concept  (node)  level  measures,  direct  connectivity 


Concept 

outdegree 


Concept 

indegree 


Concept 

outdegree 

centrality 

Concept 


Total  number  of 
statements  with  concept 
in  anterior  position 


Total  number  of 
statements  with  concept 
in  posterior  position 

Total  number  of 
statements  with  concept 
in  anterior  position/ 
Number  of  unique 
concepts  in  text 
Total  number  of 


Outdegree, 
expansiveness,  actor 
degree  centrality 
(Wasserman  &  Faust 
1994);  Prestige, 
Influence  (Mrvar) 

Indegree,  receptivity, 
popularity,  actor  degree 
prestige  (Wasserman  & 
Faust  1994);  Prestige, 
Support  (Mrvar) 

Outdegree  Centrality 
(Wasserman  &  Faust 
1994); 

Relative  Influence 
(Mrva  r) 

Indegree  Centrality 


Local 

imageability 


Local 

evokability 


None 


None 
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indegree  statements  with  concept 

centrality  in  posterior  position/ 

Number  of  unique 
concepts  per  text 


Total  degree 


Concept  indegree  + 
concept  outdegree 


(Wasserman  &  Faust 
1994); 

Relative  Support  (Mrvar) 


Map  (graph)  level  measures,  direct  connectivity 


Text 

outdegree 

centrality 


Sum  (largest  observed  Group  outdegree 

outdegree  -  outdegree  of  centralization 
concepts)/(  Number  of  (Wasserman  &  Faust, 

unique  concepts  in  text)  2  1994) 


Mean 

concept 

outdegree 

centrality 

Variance  of 
concept 
outdegree 
centrality 

Text 

indegree 

centrality 

Mean 

concept 

indegree 

centrality 

Variance  of 
concept 
indegree 
centrality 


Density 


Mean  outdegree  (=  Mean 
Sum  (outdegree)/  Number  indegree) 
of  unique  concepts  in  text  (Wasserman  &  Faust 


Sum(sum  outdegree  - 
mean  outdegree)  2  / 
Number  of  unique 
concepts  in  text 

Sum  (largest  observed 
indegree  -  indegree  of 
concepts)/(  Number  of 
unique  concepts  in  text)  2 

Sum  (indegree)/  Number 
of  unique  concepts  in  text 

Sum  (sum  indegree  - 
mean  indegree)  2  / 
Number  of  unique 
concepts  in  text 


Number  of  statements/ 
Possible  number  of 
statements 


1994) 

Variance  of  outdegree 
(Wasserman  &  Faust 
1994,  p. 127-128) 

Group  indegree 
centralization 
(Wasserman  &  Faust, 
1994) 

Mean  indegree  (  =  Mean 
outdegree) 

Variance  of  indegree 
(Wasserman  &  Faust 
1994,  p. 127-128) 

Density  (Wasserman  & 
Faust  1994,  p.l29,  Scott 
1991,  p.74) 

Wasserman  and  Faust 
use  g(g-l)  as 
denominator:,  we  use 
(g*g),  because  unique 
concept  can  form 
statement  with  same 
unique  concept  (e.g. 
agent-agent) 


Concept  (node)  level  measures,  indirect  connectivity 


Concept 

closeness 

centrality 


Minimum  possible  total 
distance  from  node  i  to  all 
other  nodes/  Sum  of  all 
geodesics  between  node  i 
and  all  other  nodes 


Closeness  (Wasserman  & 
Faust,  Mrvar) 

According  to  Wasserman 
and  Faust  (1994,  p.200) 
group  level  closeness 
centrality  is  not 
computed 


Local  density 


None 


None 


None 


None 


None 


None 


None 


None 
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Concept 

betweenness 

centrality 

sum  ((Number  of  all 
geodesics  between  all 
nodes  that  go  through 
node  i  )/(  Number  of 
geodesics  between  node  i 
and  all  other  nodes))/(( 
Number  of  unique 
concepts  in  text  - 
l)(Number  of  unique 
concepts  in  text  -2)) 

Betweenness  (Gould 
1987,  Mrvar) 

None 

Concept 

Number  of  concepts 

Proximity  Prestige 

proximity 

directly  or  indirectly 

(Wasserman  &  Faust 

None 

prestige 

adjacent  to  node  i 

1994,  Mrvar) 

Map  (graph) 

level  measures,  indirect 

connectivity 

Text 

Proximity 

Prestige 

Sum  (Proximity  Prestige 
(all  unique  concepts  in 
text)/  Number  of  unique 
concepts  in  text 

Group  level  proximity 
prestige  (Wasserman  & 
Faust  1994) 

None 

Entropy 

Entropy  is  the  formalization  of  redundancy  and  diversity.  Thus  we  say  that  Information 
Entropy  (H)  of  a  text  document  (X)  where  probability  p  of  a  word  x  =  ratio  of  total 
frequency  of  x  to  length  (total  number  of  words)  of  a  text  document. 

=  -  S  loga^jr) 

ireJC 


To  run  Entropy  you  will  need  to  load  multiple  text  documents  into  AutoMap  for  analysis. 

File  >  Open  Multiple  Files  >  Select  Folder. 

Navigate  to  the  folder  containing  the  texts  you  wish  to  analyze,  and  single  click  on  it. 

In  the  menu  bar,  select  option  Run  Analysis 

Select  the  Multiple  Map  Analysis  option  as  shown  in  the  diagram  below: 


Single  Meta  Matrix  Text  Analysis 
Multiple  Meta  Matrix  Text  Analysis 


Single  Sub  Matrix  Text  Analysis 
Multiple  Sub  Matrix  Text  Analysis 
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Once  you  have  selected  the  Multiple  Map  Analysis,  go  to  Panel  2,  tab  no.  4.  Analysis 
Settings,  then  select  Tab  no.  2.  Output  Options. 

You  will  find  the  information  in  an  excel  file,  which  will  be  located  in  the  last  column  of 
the  stats_overall.csv  output  file. 

Reference 

Theory: 

W.  Weaver  &  C.  E.  Shannon,  The  Mathematical  Theory  of  Communication,  Urbana, 
Illinois:  University  of  Illinois  Press,  1949 

Empiric  application: 

Diesner,  J.,  Carley,  K.M.,  &  Katzmair,  H.,  The  Morphology  of  a  Breakdown:  How  the 
semantics  and  mechanics  of  communication  networks  from  an  organization  in  crises 
relate.  XXVII  Sunbelt  Social  Network  Conference,  Corfu,  Greece,  May  2007. 
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DYNetML 

Changes  in  DyNetML  outputs  for  XML  files  per  Text 

The  DyNetML  outputs  per  Map  (Semantic  Network  Analysis)  and  per  Text  (Sub-Matrix 
Analysis)  have  been  changed  to  reflect: 

•  The  new  DyNetML  specification 
(http://casos.isn.cmu.edu/dynetml/index.html) 

•  Isolated  entities 

•  Frequency  of  entities 

•  A  set  of  attributes  on  the  data  and  statistical  results 

This  change  does  not  impact  the  consolidated  DyNetML  files  as  produced  in  AutoMap. 
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CASOS  Email  Parser  (CEMAP) 


1.  Instructions  for  the  using  the  CASOS  Email  Parser  (CEMAP) 
through  AutoMap 

introduction 

The  CASOS  Email  Parser  (CEMAP)  that  is  launched  through  AutoMap 
(http://www.casos.cs.cmu.edu/projects/automap/software.html)  enables  the  extraction  of 
different  types  of  network  information  from  emails  (e.g.  who  exchanges  information, 
who  provides  what  information,  etc.).  The  following  image  shows  what  types  of 
information  can  be  extracted  with  CEMAP. 


Network  typel  (Social  network)  represents  social  network  data  that  can  be  extracted  from 
email  headers  (e.g.  agent-agent  networks).  In  SN,  entities  represent  people,  and  edges 
represent  exchanged  emails  (frequency  count).  This  network  type  does  not  require  any 
text  coding  in  AutoMap. 

Networks  types  2  and  3  are  extracted  by  performing  text  coding  in  AutoMap.  More 
specifically.  Knowledge  networks  (KK)  represent  semantic  network  or  mental  models 
that  are  contained  in  the  bodies  of  individual  emails.  In  KK,  entities  represent  knowledge 
items,  and  edges  represent  the  co-occurrence  of  terms  in  text.  Eor  MM,  texts  are  coded  in 
AutoMap  according  to  a  taxonomy  or  ontology  (e.g.  meta-matrix,  while  are  ontologies 
can  be  specified  by  the  user).  In  MM,  entities  represent  instances  of  categories  (e.g. 
agent,  knowledge,  resources)  of  the  ontology,  and  edges  represent  co-occurrences  of 
terms  in  texts. 

Networks  types  4  and  5  result  from  the  combination  of  SN  with  KK  and  MM, 
respectively.  In  type  4,  entities  represent  people  and  knowledge,  and  edges  represent 
emails  and  mental  models.  In  type  5,  nodes  represent  the  categories  of  the  taxonomy  as 
specified  by  the  user,  and  the  edges  represent  the  co-occurrence  of  the  terms  that 
represent  instances  of  the  taxonomy  in  the  corpus.  Eor  the  creation  of  type  4  and  5,  the 
extraction  of  type  2  and  3,  respectively,  is  mandatory. 
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CEMAP  stores  all  network  data  as  DyNetML  files  (a  derivate  of  XML).  This  data  can  be 
analyzed  with  any  package  that  reads  DyNetML.  We  recommend  using  the  ORA 
software  for  this  purpose  (http://www.casos.cs.cmu.edu/projects/ora/index.html). 

How  to  do  it: 

1.  Extraction  of  network  type  1  data: 

•  1.1.  This  network  type  is  needed  to  perform  social  network  analysis  (SNA) 
based  on  header  information  only. 

•  1.2.  In  CEMAP,  Analysis  Type,  select  the  type  of  your  mail  box. 

•  1.3.  In  CEMAP,  Input  Data  Set,  specify  your  email  client,  inbox  type, 
username,  and  password  as  semicolon  separated  values,  e.g. 

pop.  g  mail,  com;  IN  BOX;  John  Doe;  JohnDoesPassword 

•  1.4.  In  CEMAP,  Enter  Destination  Directory  for  Texts,  put  in  a  directory  of 
your  choice,  e.g.  C:\textlnput.  The  email  bodies  will  be  stored  in  AutoMap 
format  (.txt)  under  this  directory.  Make  sure  that  this  directory  does  exits  on 
your  machine. 

•  1.5.  In  CEMAP,  Enter  Destination  File  for  Network,  put  in  a  directory  AND 
filename  of  your  choice.  The  filename  extension  needs  to  be  .xml.  e.g. 
C:\output\AA.xml. 

•  1.6.  You  are  now  ready  to  extract  network  type  1.  In  order  to  do  so,  press  the 
Start  Extraction  button.  As  a  result,  CEMAP  will  extract  network  data  from  the 
email  headers  (e.g.  agent  by  agent)  and  store  it  as  one  XML  file  under  the 
directory  that  is  specified  in  the  Destination  File  for  Network. 

•  1.7.  You  can  pick  up  the  output  file  and  load  it  into  the  SNA  package  of  your 
choice.  We  recommend  using  the  ORA  package 
(http://www.casos.cs.cmu.edu/projects/ora/download.php). 

To  extract  networks  of  type  2,  perform  Semantic  Network  Analysis  in  AutoMap. 

To  extract  networks  of  type  3,  perform  Meta-Matrix  Text  Analysis  in  AutoMap. 

To  extract  networks  of  type  4,  merge  networks  of  type  one  and  2. 

To  extract  networks  of  type  5,  merge  networks  of  type  one  and  3. 

CEMAP  is  a  feature  in  AutoMap  that  will  import  email  files  into  AutoMap  to  analyze  the 
text  in  the  emails,  such  as  who  exchanges  information,  who  provides  what  information, 
etc. 

To  use  the  feature: 

•  In  AutoMap  menu  bar,  select  Tools  >  Open  CASOS  Email  Parser  (CEMAP). 
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Took 


Help 

Open  Data  Set  Comparkon  Tool 
Open  Text  File  Splitter 
Open  CompareMap 
Open  Map  Post  Processor 
Open  Network  Reasoiw 


Open  CASOS  Email  Parser  (CEMAP) 


Open  DyNetML  Fte  Merger 

Open  Network  Data  Format  Converter 

Open  BiGram  File  Merger 

Open  AutoMap  to  Database  Connector 

Open  Text  Set  Edrtor 

Open  Network  Visualizer 

Open  Matrix  Editor 


Once  that  is  selected  the  CASOS  —  Email  Analyzer  window  will  appear.  The 
Email  Analyzer  window  is  where  you  will  type  in  your  information  of  what  type 
of  network  you  would  like  to  create  while  importing  your  emails. 

The  different  types  of  imports  are 

o  Full  email 

o  Header  information  only 
o  Forwarded/ New  emails 
o  Subject  lines  only 

Select  the  type  of  Eormat  your  email  is  in,  from  the  drop  down  menu. 

o  MBOX  —  File  that  is  stored  in  standard  file  folders 
o  POP3  and  IMAP  —  Emails  that  are  on  a  server 

■  Will  require  Username,  User  password,  Server  Name,  and 
Folder  Name 

o  TXT  —  Exported  email  files  from  outlook 

■  Will  require  a  Data  (file)  set  identifier 
o  Enron  and  CasosDB  are  for  internal  use  only 
o  Dynetml  —  Currently  under  development 

Once  you  have  selected  what  type  of  format  you  would  like  to  use,  click  on  the 
Start  Extraction  button.  This  will  begin  the  process  of  importing  your  emails. 
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Support 

How  to  cite  AutoMap 

Carley,  K.M.,  &  Diesner,  J.  (2005).  AutoMap:  Software  for  Network  Text  Analysis. 

Further  Information 

For  further  information  on  AutoMap  please  visit: 
http://www.casos.cs.cmu.edu/projeets/automap 

On  this  web  page  you  will  find: 

•  Publications  about  AutoMap,  Network  Text  Analysis,  and  Map  Analysis 

•  Installers  for  AutoMap 

•  Contact  information 

•  Information  about  sponsors  of  AutoMap 

•  Information  about  people  working  on  AutoMap 

We  also  provide  a  online  discussion  forum  for  AutoMap: 

•  to  discuss  questions  related  to  the  AutoMap  software; 

•  and  get  help  from  other  AutoMap  users  and  the  developers  of  the  software 
with  using  the  tool. 

Questions,  Bugs,  and  Comments 

Please  contact  us: 

Dr.  Kathleen  M.  Carley  (kathleen.carley@cmu.edu) 

Jana  Diesner  (diesner@cs.cmu.edu) 

Technical  Writer,  Matt  De  Reno  (mjdereno@andrew.cmu.edu) 

Carnegie  Mellon  University 
School  of  Computer  Science 
Institute  for  Software  Research  International  (ISRI) 

Center  for  Computational  Analysis  of  Social  and  Organizational  Systems  (CASOS) 
5000  Forbes  Avenue 
1325  Wean  Hall 
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Pittsburgh,  PA,  15213 


We  provide  an  online  discussion  forum  for  AutoMap: 

•  To  discuss  questions  related  to  the  AutoMap  software. 

•  And  get  help  from  other  AutoMap  users  and  the  developers  of  the  software 
with  using  the  tool. 
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Lesson  1 

From  content  analysis  to  semantic  networks 
i.  Open  AutoMap 
On  Empire: 

•  Navigate  to  C:\Documents  and  Settings\Carley\Desktop\Tools. 

•  Double-click  the  Run  Automap  icon.  The  Automap  Graphical  User  Interface 
(GUI)  pops  up. 

From  anywhere  else:  (requires  a  web  connection) 
http://www.casos.cs.cmu.edu/projects/automap/software.html 

II.  Overview  of  the  AutoMap  Graphical  User  Interface  (GUI) 

Below  is  a  screen  capture  of  the  AutoMap  GUI: 


Menus 


iBfOwse  Menu 


Index  Cards 


Panel 


The  AutoMap  GUI  is  divided  into  four  primary  quadrants,  as  shown  above,  or  panels:  PI 
(top  left),  P2  (bottom  left),  P3  (top  right),  and  P4  (bottom  right). 
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A  drop-down  menu  bar  at  the  top  of  the  window  will  provide  access  to  analysis  tools  and 
utilities.  The  Browse  Menu  allows  you  to  quickly  navigate  between  texts  you  have  loaded 
into  Automap. 

Panel  P2  can  be  edited,  but  the  remaining  panels  (PI,  P3,  P4)  cannot.  The  information 
that  is  displayed  on  each  panel  is  always  related  to  the  information  displayed  in  the  other 
three  panels.  The  Browse  Menu,  however,  relates  to  all  four  panels  at  the  same  time. 

III.  Load  multiple  text  documents  into  AutoMap 

In  order  to  analyze  multiple  pieces  of  text  simultaneously,  you  will  need  to  save  the 
documents  together  in  one  folder. 

Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLesson1\  TextInputFiles 

•  In  the  AutoMap  menu  bar,  select  File  >  Open  Multiple  Files.  A  folder  selection 
window  will  pop  up. 

•  Navigate  to  the  folder  containing  the  documents  you  wish  to  analyze,  and 
single-click  on  it. 

Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLesson1\  TextInputFiles 

•  Click  the  Open  button  in  the  bottom  right  corner. 

The  first  text  will  be  displayed  in  panel  PI  under  the  tab  titled  1. Original  Texts.  You  can 
use  the  Browse  Menu  to  navigate  through  the  texts. 

Below  is  a  screen  capture  of  the  first  text  display: 
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The  Browse  Menu 
allows  you  to  quickly 
navigate  from  one  text 
sample  to  the  next 


IV.  Carry  out  a  simple  content  analysis 

A  simple  content  analysis  determines  the  frequencies  of  all  words  in  a  text  document. 

Examine  the  Concept  List: 

The  Concept  List  considers  concepts  in  each  text  file  individually.  It  is  created 
automatically  when  a  document  or  set  of  documents  is  loaded  into  Automap.  This  will  be 
displayed  in  the  P2  panel  under  the  tab  titled  1.  Concept  List. 

The  Concept  List  will  tell  you  several  important  facts  about  each  document  in  your  set: 

•  The  number  of  concepts  found  in  the  text  displayed  in  PI  under  the  tab  titled 
1. Original  Text 

•  Related  frequencies  of  those  concepts 

•  Cumulated  unique  concepts  and  total  concepts  contained  in  the  data  set 
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Note:  The  number  of  unique  concepts  considers  each 
concept  oniy  once.  The  number  of  totai  concepts 
considers  repetitions  of  concepts. 

By  default,  the  Concept  List  sorts  by  decreasing  frequency  of  concepts.  To 
sort  the  list  alphabetically,  click  on  the  first-column  header  Concept.  To 
resort  the  list  by  frequency,  click  on  the  second-column  header  Frequency. 


Below  is  a  screen  capture  showing  part  of  the  Concept  List,  sorted  by 
frequency,  for  the  first  document  in  our  loaded  folder  of  texts  (Textl-in- 
user's  guide.txt): 


Ciick  this 
column  heading 
to  sort  concepts 
alphabetically 


Click  this 
column  heading 
to  sort  concepts 
by  frequency 


Ffe  RunAnaly«K  Tools  Help 


]^[T]  l/2C^[TDGoto:  2  [ 


OK 


6.  Texts  after  Meta+Wtrix  Thesai^us  7.  Texts  after  Sub-Matrix  Selection 

<.  Texts  after  Deletion  5.  Texts  after  Generafeation 

I .  Ongirval  Texts  2.  Texts  after  Symbol  Removal  3.  Texts  after  Stemming 

He.  Cray's  brown  dog  ace  Che  locus  blossom  ac  10  euo.  Hrs. 
Brown  was  unhappy  wich  Che  dog.  She  yelled  ac  ic  saying  ”You 
impossible  dog!”  Buc  che  dog  kepc  eacing  che  flowers  and 
weeds.  She  asiced  Hr.  Cray  co  scop  Che  dog.  He  couldn'c.  Hrs. 
Brown  planced  roses  and  weeded  che  garden.  The  silly  dog  C 

rnw^  Innlrinn  tnr  m  \m\^  fin  .Tiin^  ^7  Iftftfl 

3.  Pre-Processrg  Settngs  4 .  Analysis  Setbngs 

In  Delete  List  Translation  In  Thes... 

lOj  □ 


Create  and  refresh  the  Union  Concept  List: 

The  Union  Concept  List  considers  concepts  in  all  texts  in  a  set  (unlike  the 
Concept  List,  which  considers  each  text  in  the  set  individually).  It  is 
located  in  Panel  P2 

under  the  tab  titled  2. Union  Concept  List. 

The  Union  Concept  List  tells  you  several  important  facts  about  your  text 
set: 
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■  Concepts  contained  in  all  loaded  text  sets 

■  Related,  cumulative  frequencies  of  concepts  in  all  text  sets 

■  Cumulative  unique  and  total  concepts 

Before  viewing  your  Union  Concept  List,  you  must  refresh  it.  To  refresh 
your  Union  Concept  List,  go  to  the  Menu  Bar,  choose  File  >  Refresh 
Union  Concept  List. 

The  Union  Concept  List  can  be  refreshed  after  each  step  of  pre-processing. 
This  will  allow  you  to  visualize  the  impact  of  pre-processing  operations  on 
the  union  of  concepts. 


Below  is  a  screen  capture  showing  where  to  find  the  Refresh  Union 
Concept  List  command: 


l^un  Analysis  Tools  Help 

■"3  OpensngleHe 

Open  mukipte  Hes  (Select  Mder) 

Open  Delete  List 
Open  Genet  akzebon  Theseirus 
Open  Mete-Metnx  Theseutu* 

'  Ti'Tftinn 

Creete  end  Refresh  Union  Concept  List 

^n|iMint^  ■■ 

iei  Save  Concept  List  pec  Text 
igi  Seve  Uraon  Concept  List 
Save  ep(4ed  Delete  List 
^  Save  eppied  Genet  akzetian  Theseirus 
ji  Save  appled  Meta-Matra  Thesaurus 
^  Save  Sub-Matrix  Selection 

Save  Text(s)  after  Stemmer  appted 
Save  Text<s)  after  Delete  List  appied 


OK 
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2„2£lstia. 


1 .  Action  Tracer  Panel 
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ki  :) 


□ 


dear 


After  refreshing  your  Union  Concept  List,  you  can  view  it  under  the 
2. Union  Concept  List  tab  in  Panel  P2.  Below  is  a  screen  capture  showing 
where  this  tab  is  located  in  the  GUI: 

By  default,  the  list  is  sorted  by  decreasing  frequency  of  concepts.  To  sort 
the  list  alphabetically,  click  on  the  first-column  header  Concept.  To  re-sort 
the  list  by  Frequency,  click  on  the  second-header  column  Frequency. 
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Note:  The  number  of  unique  concepts  considers  each 
concept  oniy  once.  The  number  of  totai  concepts  aiso 
considers  repetitions  of  concepts. 

y.  Save  your  Union  Concept  List  as  a  csv  fiie 

To  specify  a  location  for  your  Union  Concept  List  file,  choose  File  > 
Output  Storage  Manager. 

Below  is  a  screen  capture  showing  where  to  find  this  in  the  GUI: 


gS  RunAr>aty$is  Tools  Help 

i/jf  open  sngle  fie 

gjr  Open  muftiple  Nes  (Select  folder) 

Open  Delete  List 

> 

Open  Generalzatlon  Thesaurus 

> 

Open  Meta-Matnx  Thesairus 

> 

Open  Sub-Matnx  Selection 

> 

Create  arxj  Refresh  Urwn  CorKept  List 

C^Uput  Storage  Manage^ 

S4v«  Concept  LM  per  Text 
Save  Uraon  Concept  List 
Save  eppked  Delete  list 
jJ  Save  appked  Gerrerekation  Thesaurus 
wi  Save  appLed  Meta-Matrix  Thesaurus 
ri  Save  Sob-Matnx  Seloction 
Save  Text(s)  after  Cleaning 
Save  Text(s)  after  steinmer  appled 
jJ  Save  Text(s)  after  Delete  List  applied 
wi  Save  Text(s)  after  Generakzation  Thesaurus  appled 
■jJ  Save  Text(s)  after  Meta-Matnx  Thesaurus  appied 
jJ  Save  Text(s)  after  Sdb-Matnx  selectxxi  appted 
Clear 
Ex* 


A  window  will  pop  up  listing  all  pre-processing  output  storage.  This  is  the 
Output  Storage  Manager. 

The  Union  Concept  List  appears  as  the  second  item  in  the  Output  Storage 
Manager.  Click  the  Save  file  as. . .  option  to  the  right  of  the  default  file 
pathway.  In  the  file  chooser  window  that  pops  up,  browse  to  the  pathway 
you  want. 
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Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLesson1\ 

UnionConceptList 

Below  is  a  screen  capture  showing  where  to  find  the  correct  Save  file  as. . . 
button  in  the  Output  Storage  Manager: 


Click  the  Save  Settings  button  at  the  bottom  right  comer  of  the  Output 
Storage  Manager.  Close  the  Output  Storage  Manager  window  by  clicking 
on  the  red  X  at  the  top  right  corner. 

In  the  Automap  Menu  Bar,  choose  File  >  Save  Union  Concept  List. 

VI.  View  your  csv  file  in  Microsoft  Excel 

Navigate  to  the  location  of  your  new  csv  file  and  double-click  on  its  icon. 

Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLesson1\ 

UnionConceptList 

A  box  titled  “Open  With”  pops  up.  Click  on  Microsoft  Excel,  then  click 
OK. 
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Note:  You  must  have  MS  Excel  installed  on  your 
computer  in  order  to  view  your  csv  file  this  way. 

VII.  Save  and  view  your  results  as  a  binary  csv  file 

Follow  Steps  3  and  4. 

VIII.  Carry  out  a  simple  Semantic  Network  Analysis 

A  Simple  Network  Analysis  involves  no  thesauri  and  uses  all  words.  A 
Semantic  Network  Analysis  can  be  run  on  original  input  texts  or  on  texts 
that  have  been  pre-processed  with  a  Delete  List  and/or  a  Generalization 
Thesaurus. 

Before  running  the  analysis: 

Make  sure  you  have  completed  the  following  steps: 

■  Pre-process  the  texts  if  you  wish 

■  Specify  the  analysis  settings  (otherwise  the  default  settings  will 
be  applied) 

■  Specify  additional  output  options  if  you  wish  to  (if  not,  the 
default  outputs  will  be  generated) 

About  the  analysis  settings: 

Use  the  Analysis  Settings  index  card  to  specify  the  analysis  settings.  Your 
settings  will  be  automatically  applied  to  the  analysis  (you  do  not  need  to 
confirm  them). 

Below  is  a  screen  capture  showing  where  to  find  the  Analysis  Settings 
index  card  in  P2: 
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l.ConceptList  2.  Union  Concept  List  3.  Pre-Processing  Settings;  4.  Analysis  Settngs 

1.  Analysis  Settngs  2.  Output  Options 

Coding  Ties 

Directionality 

Strength 

0  Uni-directiorMi 

H  Frequency 

0  Bi-drectiof«l 

Punctuation 

WiryJow-Sce 

0  Ingnore  punctuation  completely 

2  g 

O  Reset  window  at  the  erxl  of  pharagraphs  only 

1  fHnd  see  of  largest  text  in  text  set 

O  Reset  window  at  the  erxl  of  pharagraphs  and  sentences 

The  following  chart  lists  the  possible  analysis  settings: 


DirectionalitySelect 

one  of  the  following  two 
possibilities  by  checking 
the  button. 

Coding  Ties 

Specify  the  way 
statements  are 
counted. 

StrengthStrength  will 
be  printed  out  by 
default.  To  not  print  out 
Strength  uncheck  the 
item. 


Windowing 

Windowing  is  a 
method  that  codes 
the  text  as  a  map 
by  placing 
relationships 
between  pairs  of 
Concepts  that 
occur  within  a 
window. 


Punctuation 

Select  one  option  by 
checking  the  radio 
button. 


Uni-Directional  (When 
coding  a  tie,  only  1st- 
>2nd  concept  should  be 
noted) 

Bi-Directional(When 

coding  a  tie,  both  1st  <- 
>  2nd  concept  shall  be 
noted) 

Frequency  (The 
cumulative  frequency  of 
every  existing 
statement.) 

Item  not  checked: 
Existence  of  frequency 
will  be  printed  out 
(binary  result). 

Ignore  punctuation 
completely  (Statements 
will  be  placed  between 
all  concepts.) 

Reset  window  at  end 
of  paragraphs  only 
(Statements  will  be 
placed  only  within  every 
single  paragraph.) 
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Reset  window  at  end 
of  paragraphs  and 
sentences  (Statements 


Window  Size 

Select  one  window  size 
by  using  the  number 
chooser. 


will  be  placed  only  within 
every  single  sentence.) 
Window  size  between  2 

and  100. 

The  Window  Size  defines 
how  distant  concepts  can 
be  and  still  have  a 
relationship.  Only 
concepts  in  same 
window  can  form 
statements. 


If  you  do  not  want  to  change  any  of  the  suggested  options,  the  analysis 
will  be  done  with  a  set  of  standard  (default)  settings. 

Below  is  a  screen  capture  showing  the  standard  settings: 


About  the  output  options: 

The  map  and  the  statistic  output  generated  by  AutoMap  are  displayed  in 
Panels  P3  and  P4,  respectively.  In  addition,  AutoMap  offers  further  output 
options  that  can  be  selected  in  the  10. Output  Options  index  card  in  Panel 
P2.  Any  other  additional  outputs  are  generated  after  analyses  are  run. 

Below  is  a  screen  capture  showing  the  10. Output  Options  index  card  in 
P2: 
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1 .  Concept  List ,  2.  Union  Concept  List  ||  3.  Pre-Processing  Setting^  Analysis  Settings 

1.  Analysis  Settings  |  2.  Output  Options  _ 

Term  Distribution  Lists  ^ 

f~|  Concepts  analyzed 
Q  Concepts  in  statements  and  isolates 
r~|  Statements 

Term  Distribution  Matrices 

concepts  in  statements  by  concepts  in  statements: 


Q  Concepts  in  statements 

0  count 

O  binary 

term(s)  by  text(s): 

|~~]  Concepts  analyzed 

0  count 

0  binary 

r~|  Concepts  in  statements  and  isolates 

0  count 

0  binary 

r~1  Statements 

0  count 

0  binary 

Additional  Output  Options  per  File 


For  all  types  of  multiple  analysis,  term  distribution  lists  and  matrices  can 
be  selected  as  output  options  in  the  upper  two  fields  of  the  10. Output 
Options  index  card  in  P2. 

Points  to  consider: 

■  No  list  or  matrix  is  generated  by  default. 

■  Lists  or  matrices  are  only  generated  if  the  user  checks  the  item 
wanted  and  runs  an  analysis  (of  any  type). 

■  If  pre-processing  was  performed,  the  list  relates  to  the  stage  of 
pre-processing  that  was  used  for  the  analysis.  If  several  pre¬ 
processing  techniques  are  applied,  analysis  will  run  on  the 
highest  stage  of  pre-processing. 

■  The  requested  lists  and  matrices  are  automatically  saved  in  a 
folder  called  Term  Distribution  Lists  and  Matrices  under  the  root 
directory  of  AutoMap.  This  folder  is  overwritten  with  new 
analysis  you  run.  If  you  want  to  save  the  results  somewhere 
other  than  the  default  Term  Distribution  Lists  and  Matrices 
folder,  simply  rename  that  folder. 

■  Two  output  lists  are  generated  for  each  Term  Distribution  List 
checked. 

■  One  output  matrix  is  generated  for  each  Term  Distribution 
Matrix  checked. 

The  following  chart  lists  the  types  and  content  of  Term  Distribution  Lists 
and  Matrices: 
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Output  Type 


Name  of 
output 


Concepts 

analyzed 


Term  Concepts  in 

Distribution  statements 
List  and  isolates 


Statements 


Content  of 
output 

List  of  concept  Concept,  Text, 

analyzed. CSV  Frequency 

Concept,  Cumulated 
sum  across  text  set. 
Statistics  of  Number  of  text 

concepts  concept  occurs  in, 

analyzed. CSV  Percentage  of  texts 

concept  occurs  in. 
Texts 

List  of  concept  in  Concept,  Text, 

statements. CSV  Frequency 

Concept,  Cumulated 
sum  across  text  set. 
Statistics  of  Number  of  text 

concept  in  concept  occurs  in, 

statements. CSV  Percentage  of  texts 

concept  occurs  in. 
Texts 


List  of  isolates  in  Concept,  Text, 
statements. CSV  Frequency 


Statistics  of 
isolates. CSV 


Concept,  Cumulated 
sum  across  text  set. 
Number  of  text 
concept  occurs  in. 
Percentage  of  texts 
concept  occurs  in. 
Texts 


List  of 

statements. CSV 


Statistics  of 
statements. CSV 


Statement,  Text, 
Frequency 

Statement,  Cumulated 
sum  across  text  set. 
Number  of  text 
statement  occurs  in. 
Percentage  of  texts 
statement  occurs  in. 
Texts 


Term  Concepts  in  statements  by  concepts  in  statements: 

Distribution 

Matrices 


Concepts  in  Matrix  of 
statements  concepts  in 

statements. CSV 


2007  AutoMap  Users  Guide  -  Page  146 


Matrix  of  Concept  that 
were  linked  into 
statements  (first  row) 
by  Concept  that  were 
linked  into  statements 
(first  column) 

If  count  was  chosen, 
cells  contain  cumulated 
frequency  of  concept 
If  binary  was  chosen, 
cells  denote  existence 


Term(s)  by  text(s): 


(1)  or  absence  (0)  of 
concept 


Concepts 

analyzed 


Concepts  in 
statements 
and  isolates 


Statements 


Matrix  of  Concept 
(union  of  concepts 
listed  in  first  row)  by 
texts  (all  text  names 
listed  in  first  column) 
Matrix  of  concept  If  count  was  chosen, 
analyzed. CSV  cells  contain  cumulated 

frequency  of  concept 
If  binary  was  chosen, 
cells  denote  existence 
(1)  or  absence  (0)  of 
concept 


Matrix  of  concept 
in 

statements. CSV 


Matrix  of  Concept 
(union  of  concepts 
listed  in  first  row)  by 
texts  (all  text  names 
listed  in  first  column) 

If  count  was  chosen, 
cells  contain  cumulated 
frequency  of  concept 
If  binary  was  chosen, 
cells  denote  existence 
(1)  or  absence  (0)  of 
concept 


Matrix  of  Statements 
(union  of  statements 
listed  in  first  row)  by 
texts  (all  text  names 
listed  in  first  column) 
Matrix  of  If  count  was  chosen, 

statements. CSV  cells  contain  cumulated 

frequency  of  statement 
If  binary  was  chosen, 
cells  denote  existence 
(1)  or  absence  (0)  of 
statement 


Save  your  upcoming  analysis  in  DyNetML  format: 

Before  running  your  Multiple  Map  Analysis,  you  must  specify  that  your 
results  will  be  in  DyNetML  format  so  that  the  results  can  eventually  be 
read  into  ORA. 


On  the  10. Output  Options  index  card  in  Panel  P2,  scroll  down  to  the 
option  titled  “Additional  Output  Formats”  and  check  the  per  Map  box  next 
to  “DyNetML  for  Map  Analysis.” 
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Open  the  Output  Storage  Manager.  (See  section  V.) 

You  will  see  “Text  Analysis  Output  Directory &rdqot;  listed  under  the 
“Analysis  Output  Storage&rdqot;  category.  Click  the  Save  in  folder. . . 
button  on  the  far  right  in  that  row.  In  the  folder  chooser  window  that  pops 
up,  navigate  to  the  folder  you  want. 

Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLesson1\ 

TextOutputFiles 

Click  Open. 

Save  these  settings  and  close  the  Output  Storage  Manager.  (See  section 
V.) 

Run  the  Multiple  Map  Analysis: 

In  the  Menu  Bar,  choose  Run  Analysis  >  Multiple  Map  Analysis. 

The  results  will  be  displayed  on  the  Semantic  Network  of  Current  Text 
index  card  in  Panel  P3  and  on  the  2. Statistics  index  card  in  Panel  P4. 

Below  is  a  screen  capture  showing  the  results  of  Multiple  Map  Analysis  in 
Panels  P3  and  P4: 
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If  you  have  analyzed  multiple  texts,  you  ean  browse  through  the  results 
and  see  the  related  texts.  Results  of  multiple  analyses  are  automatieally 
saved  in  a  folder  titled  “output”  under  the  directory  where  Automap  is 
installed. 

The  output  folders  are  overwritten  with  every  new  analysis  you  run.  If  you 
want  to  save  the  results  of  a  current  analysis  folder,  simply  rename  that 
folder. 

IX.  Open  ORA 

On  Empire: 

■  Navigate  to  C:\Documents  and  Settings\Carley\Desktop\Tools. 

■  Double-click  the  Run  ORA  icon.  The  ORA  interface  pops  up. 

From  anywhere  else:  (requires  a  web  connection) 
http://www.casos.cs.cmu.edu/projects/ora/software.html 
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X.  Load  your  files  into  ORA 

Click  the  Load  button  on  the  far  right.  A  file  chooser  window  will  pop  up. 

Choose  Select  Files.  Navigate  to  your  NyNetML  files  (generated  in 
section  VIII). 

Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLesson1\ 

TextOutputFiles 

Select  the  files  you  want  by  holding  down  the  <ctrl>  key  while  clicking  on 
each  one.  Click  Open,  then  Finish. 

Example: 

15.  Select  Textl-in-user's  guide.txt 
16.Press  the  Control  key 
IV.Select  Text2-in-user's  guide.txt 

Note:  If  some  file  is  already  loaded  into  ORA,  choose  the 
Append  As  Additional  Meta-Matrix  option. 

XI.  Visualize  your  meta-matrices 

The  left-hand  portion  of  the  ORA  screen  is  the  ORA  navigator.  In  this 
navigator,  select  the  matrix  you  want  to  visualize  by  clicking  on  it.  Click 
the  Visualize  this  meta-matrix  button  near  the  center  of  the  screen  to 
generate  the  visualization.  Below  are  the  visualized  meta-matrices  for 
Textl-in-user's  guide.txt  and  Text2-in-user's  guide.txt: 

Textl-in-user's  guide.txt 
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saying 


solved 
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These  meta-matrices  also  can  be  found  in: 

C:\Documents  and 

Settings\carley\Desktop\AutomapLessonl\MetaMatrices 

XII.  Run  a  Semantic  Network  Report 

In  ORA's  Menu  Bar,  choose  Analysis  >  Generate  Reports.  A  window  will 
pop  up.  You  will  see  “Select  Report”  at  the  top  of  the  window.  Click  the 
V-shaped  icon  to  the  right,  and  select  Semantic  Network  from  the  resulting 
drop-down  menu. 


Below  is  a  screen  capture  showing  where  to  find  the  Semantic  Network 
Report  in  the  Generate  Reports  pop-up  window: 


Check  the  two  boxes  for  Text  1 -in-user's  guide.txt  and  Text2-in-user's 
guide.txt  inside  the  field  titled  “Select  one  or  more  meta  matrices”. 


To  specify  a  filename  for  the  results,  click  the  Browse  button  near  the 
bottom  right  corner  of  the  Generate  Reports  pop-up  window.  Navigate  to 
the  desired  location  and  type  in  a  filename  or  you  can  use  the  default 
name. 
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Example:  C:\Documents  and 

Settings\carley\Desktop\AutomapLesson1\ 

SemanticNetworkReport.html 

Click  the  Next  button,  then  the  Finish  button  at  the  bottom  of  the  Generate 
Reports  pop-up  window. 

ORA  will  run.  An  HTML  file  will  pop  up  displaying  the  results. 

Return  to  the  ORA  window.  When  a  small  box  titled  “Complete”  has 
popped  up,  click  the  OK  button. 

Below  is  the  Semantic  Network  Report  generated  for  Text  1 -in-user's 
guide.txt  and  Text2-in-user's  guide.txt: 

SEMANTIC-NETWORK  REPORT 

Input  data:  Text  1 -in-user's  guide.txt,  Text2-in-user's  guide.txt 
Start  time:  Tue  Mar  06  10:59:33  2007 

This  is  a  comparison  of  two  semantic  networks.  Each  node  in  the  network 
is  considered  a  Concept,  and  each  edge  a  Statement  connecting  two 
concepts.  Statement  weights  are  interpreted  as  the  number  of  times  the 
statement  occurred  in  the  underlying  input  text. 

Network  Concepts  Statements  Density 

Textl-in-user's  guide.txt  74  200  0.0370233 

Text2-in-user's  guide.txt  69  184  0.0392157 

Union  98  310  0.032611 

Intersection  45  8  0.0040404 

Symmetric  difference 

The  symmetric  distance  of  network  A  to  network  B  is  a  new  network  that 
contains  the  entities  in  A  that  are  not  in  B. 

Statements 

Network 

s 

Textl-in-user's  guide.txt  to  Text2-in-user's  guide.txt  29  192 

Text2-in-user's  guide.txt  to  Textl-in-user's  guide.txt  24  176 
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Saved  output  networks 


The  summary  statistics  across  all  maps  saved  to:  C:\Documents  and 
Settings\jmcgille_summary_statistics.csv 

Produced  by  ORA  developed  at  CASOS  -  Carnegie  Mellon  University 


Lesson  2 

Processing  data:  using  the  deiete  iist 
i.  Open  AutoMap 
On  Empire: 

•  Navigate  to  C:\Documents  and  Settings\Carley\Desktop\Tools. 

•  Double-click  the  Run  Automap  icon.  The  Automap  Graphical  User  Interface 
(GUI)  pops  up. 

From  anywhere  else:  (requires  a  web  connection) 
http://www.casos.cs.cmu.edu/projects/automap/software.html 

it.  Overview  of  the  AutoMap  Graphicat  User  interface  (GUt) 

Below  is  a  screen  capture  of  the  AutoMap  GUI: 
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[Menus  I- 
[Bfowse  Menu 


Index  Cards 


Panel  |- 


Ne  Toob  Hcb» 

1  h  1 1  •  1  [  >  11  H  lco»a  1  CK  1 

$  Tciti  «ftt*  6  te4i  alter  SUi  HMru  Stiactton  , 

Map  er  cvt«rt  ie4 

1  OngmrtiH  j  T•,tt  rtw  SUwig 

PI 

P3 

1  C<nc«s(  L«l  2  Oaon  Concect  Lnl  9  ^e^ooeswig  ScOnQs  4  Arttftm  ScOrgt 

Concept  rreqjancy  h  Delete  list  TraniMian  n  These 

1  2  SMNci 

P2 

P4 

The  AutoMap  GUI  is  divided  into  four  primary  quadrants,  or  panels:  PI  (top  left),  P2 
(bottom  left),  P3  (top  right),  and  P4  (bottom  right). 

A  drop-down  menu  bar  at  the  top  of  the  window  provides  access  to  analysis  tools  and 
utilities.  The  Browse  Menu  allows  you  to  quickly  navigate  between  texts  you  have  loaded 
into  Automap. 

The  P2  panel  can  be  edited,  but  the  other  remaining  panels  (PI,  P3,  P4)  cannot. 
Information  displayed  on  each  panel  is  always  related  to  the  information  displayed  in  the 
other  three  panels.  The  Browse  Menu  relates  to  all  four  panels  at  the  same  time. 

III.  Load  multiple  text  documents  into  AutoMap 

In  order  to  analyze  multiple  texts  simultaneously,  you  will  need  to  store  the  documents 
together  in  one  folder. 

Example:  C:\Documents  and 
Settings\carley\Desktop\TextlnputFiles02-01-07. 

•  In  the  AutoMap  menu  bar,  select  File  >  Open  Multiple  Files.  A  folder  chooser 
window  will  pop  up. 

•  Navigate  to  the  folder  containing  the  documents  you  wish  to  analyze,  and 
single-click  on  it. 
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Example:  C:\Documents  and 
Settings\carley\Desktop\TextlnputFiles02-01-07 

•  Click  the  Open  button  in  the  bottom  right  corner. 

The  first  text  will  be  displayed  in  panel  PI  under  the  tab  titled  1. Original  Texts.  You  can 
use  the  Browse  Menu  to  flip  through  the  documents. 

Below  is  a  screen  capture  of  the  first  text  display: 


The  Browse 
Menu  allows 
you  to  quickly 
navigate  from 
one  text  sample 
to  the  next. 
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pn  the  next  day.  Hcs.  Beovn  was  now  happy  with  the  dog. 

She  said  "You  good  dog'"  The  dog  no  longer  ate  the  flowets 
and  weeds.  He.  Cray  was  pleased  too. 

Hrs.  Brown  watered  the  coses  and  fertilized  the  garden  on 
June  13,  1880. 

ProC.  Darren,  Hrs.  Brown  c  Hr.  Cray  net  over  dinner  and 
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IV.  Carry  out  a  simple  content  analysis 

A  simple  content  analysis  determines  the  frequencies  of  all  words  in  a  text  document. 

Examine  the  Concept  List: 

The  Concept  List  considers  concepts  in  each  text  file  individually.  It  is  created 
automatically  when  a  text  or  set  of  texts  is  loaded  into  Automap,  and  is  displayed  in  panel 
P2  under  the  tab  titled  1. Concept  List. 
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The  Concept  List  tells  you  several  important  facts  about  each  text  in  your  set: 

•  The  number  of  concepts  found  in  the  text  displayed  in  panel  PI  under  the  tab 
titled  1. Original  Text 

•  Related  frequencies  of  those  concepts 

•  Cumulated  unique  concepts  and  total  concepts  contained  in  the  data  set 

Note:  The  number  of  unique  concepts  considers  each  concept  oniy 
once.  The  number  of  totai  concepts  considers  repetitions  of 
concepts. 

By  default,  the  Concept  List  is  sorted  by  decreasing  frequency  of  concepts.  To  sort  the 
list  alphabetically,  click  on  the  first-column  header  Concept.  To  resort  the  list  by 
frequency,  click  on  the  second-column  header  Frequency. 

Below  is  a  screen  capture  showing  part  of  the  Concept  List,  sorted  by  frequency,  for  the 
first  document  in  our  loaded  folder  of  texts  (Text  1 -in-user's  guide.txt): 
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Click  this  column  heading  to  sort 
concepts  by  frequency. 
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Create  and  refresh  the  Union  Concept  List: 

The  Union  Concept  List  considers  concepts  across  all  texts  in  a  set  (unlike  the  Concept 
List,  which  considers  each  text  in  the  set  individually).  It  is  located  in  panel  P2  under  the 
tab  titled  2. Union  Concept  List. 

The  Union  Concept  List  tells  you  several  important  facts  about  your  text  set: 

•  Concepts  contained  in  all  loaded  text  sets 

•  Related,  cumulative  frequencies  of  concepts  in  all  text  sets 

•  Cumulative  unique  and  total  concepts 

Before  viewing  your  Union  Concept  List,  you  must  refresh  it.  From  the  Menu  Bar, 
choose  File  >  Refresh  Union  Concept  List. 

The  Union  Concept  List  can  be  refreshed  after  each  step  of  pre-processing  to  visualize 
the  impact  of  pre-processing  operations  on  the  union  of  concepts. 

Below  is  a  screen  capture  showing  where  to  find  the  Refresh  Union  Concept  List 
command: 
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After  refreshing  your  Union  Concept  List,  you  can  view  it  under  the  2. Union  Concept 
List  tab  in  panel  P2.  Below  is  a  screen  capture  showing  where  this  tab  is  located  in  the 
GUI: 
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Union 

Concept  List 


By  default,  the  list  is  sorted  by  decreasing  frequency  of  concepts.  To  sort  the  list 
alphabetically,  click  on  the  first-column  header  Concept.  To  re-sort  the  list  by  Frequency, 
click  on  the  second-header  column  Frequency. 

Note:  The  number  of  unique  concepts  considers  each  concept  oniy 
once.  The  number  of  totai  concepts  aiso  considers  repetitions  of 
concepts. 

y.  The  basics  of  pre-processing  in  AutoMap 

Now  you  are  ready  to  pre-process  the  data,  or  reduce  it  so  that  it  includes  only  terms 
which  are  relevant  to  you. 

Here  are  some  points  to  consider: 

•  Semantic  Network  Analysis  can  be  run  without  any  prior  data  pre-processing 

•  Meta-Matrix  Text  Analysis  and  Sub-Matrix  Text  Analysis  require  pre¬ 
processing 

Vi.  The  basics  of  deiete  iists 

Deletion  removes  non-content  bearing  conjunctions  and  articles  from  texts  (Carley, 

1993).  Non-content  bearing  concepts  to  be  deleted  from  the  texts  are  denoted  in  a  Delete 
List.  When  applying  a  Delete  List,  AutoMap  searches  the  text(s)  for  concepts  specified  in 
the  Delete  List  and  delete  matches  from  the  text(s). 
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For  example: 


•  Original  input  text:  The  New  York  City  Police  Department  said  a  number  of 
people  were  trapped  in  elevators  for  awhile 

•  Entries  in  the  Delete  List:  the,  a,  of,  were,  in,  for,  awhile 

•  Text  after  deletion:  New  York  City  Police  Department  said  number  people 
trapped  elevators 

The  Delete  List  is  NOT  case  sensitive. 

You  can  use  the  predefined  Delete  Lists  that  AutoMap  offers,  or  create  your  own  Delete 
List. 


All  Delete  Lists  can  be  edited. 


VII.  Open  a  Delete  List 

Select  the  File  menu,  then  Open  Delete  List.  Choose  one  of  the  following  options: 

•  Open  from  file:  A  file  chooser  appears.  Select  a  delete  list  and  click  Open 

•  Open  small  predefined  Delete  List:  This  opens  AutoMap's  predefined  small 
delete  list 

•  Open  extensive  predefined  Delete  List:  This  opens  AutoMap's  extensive  small 
delete  list 


Below  is  a  screen  capture  showing  where  to  access  the  Delete  List  utility: 
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The  Delete  List  is  located  in  panel  P2  in  the  AutoMap  GUI,  under  the  tab  titled  3. Pre- 
Processing  Settings.  It  appears  as  a  sub-tab  titled  3. Delete  List. 


Below  is  a  screen  capture  of  this  location: 
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The  Delete  List  can  be  edited  by  clicking  inside  the  panel  where  the  delete  terms  are 
listed  in  the  3. Delete  List  index  card. 

The  types  of  Delete  Lists  available  in  AutoMap  are: 

•  Small  predefined  Delete  List 

This  is  compiled  of  words  that  occur  most  frequently  in  English:  a,  an,  and, 
some,  many,  this,  that,  these,  those,  the,  all,  one,  every. 

•  Extensive  predefined  Delete  List 

This  is  based  on  words  occurring  most  frequently  in  English:  a,  an,  and,  as, 
at,  but,  for,  he,  her,  hers,  him,  his,  i,  it,  its,  me,  mine,  my,  nor,  of,  or,  our, 
she,  so,  that,  the,  their,  theirs,  them,  they,  to,  us,  we,  who,  whoever,  whom, 
whomever,  will,  would,  you,  your,  yours,  yourself.  As  the  name  indicates,  the 
Extensive  Delete  list  contains  more  words  than  the  Small  Predefined  Delete 
List. 

VIII.  Delete  concepts  and  examine  the  effects  on  the  text  and 
concept  list 

Note:  If  you  wish  to  apply  a  Delete  List  and  a  Thesaurus,  we 
recommend  applying  the  Delete  List  first,  followed  by  the  Thesaurus. 

When  applying  a  delete  list,  AutoMap: 

•  Searches  the  text  or  texts  for  concepts  specified  in  the  Delete  List 

•  Deletes  matching  concepts  from  the  texts 
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•  Displays  the  resulting  texts  in  PI  under  the  4. Texts  After  Deletion  tab 
Follow  these  steps  to  apply  a  delete  list: 

1.  Choose  an  adjacency  option  on  the  3. Delete  List  index  card.  The  types  of 
adjacency  are  the  following: 

o  Direct  Adjacency 

If  this  option  is  chosen,  concepts  in  the  text  that  match  concepts 
specified  in  the  delete  list  are  removed  from  the  texts.  As  a  result, 
concepts  on  either  side  of  the  deleted  concept  move  together  and  are 
treated  as  directly  adjacent  to  each  other  for  further  analysis  and 
visualization, 
o  Rhetorical  Adjacency 

If  this  option  is  chosen,  concepts  in  the  text  that  match  concepts 
specified  in  the  delete  list  are  removed  from  the  text,  but  placeholders 
("xxx")  are  inserted  to  hold  their  places.  The  placeholders  retain  the 
original  distances  of  the  maintained  concepts.  This  is  helpful  for  visual 
analysis. 

If  you  do  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  as  the 
default. 

2.  Click  the  Apply  Delete  List  button  on  the  3. Delete  List  index  card.  This  deletes 
the  concepts  that  correspond  to  concepts  specified  in  the  Delete  List  from  all 
loaded  texts. 

3.  View  your  newly  pre-processed  texts  in  panel  PI  under  the  4. Texts  After 
Deletion  tab. 

To  apply  multiple  delete  lists,  load  the  first  one  in  and  apply  it,  then  load  in  the  next  and 
apply  it,  and  so  on. 

IX.  Un-apply  a  Delete  List 

To  un-apply  a  Delete  List  that  was  just  applied  to  the  data,  go  to  the  3. Delete  List  index 
card  and  click  the  Un-Apply  Delete  List  button.  In  panel  P2,  the  4. Texts  After  Deletion 
index  card  will  be  cleared. 

X.  Save  an  applied  Delete  List 

To  specify  a  location  for  your  Union  Concept  List  file,  choose  File  in  the  Menu  Bar,  then 
Output  Storage  Manager. 

Below  is  a  screen  capture  showing  where  to  find  this  in  the  GUI: 
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A  window  will  pop  up  listing  all  pre-processing  output  storage.  This  is  the  Output 
Storage  Manager. 

The  Applied  Delete  List  appears  in  the  list  on  the  left-hand  side  of  the  Output  Storage 
Manager.  Click  the  Save  file  as. . .  button  to  the  right  of  the  default  file  pathway.  In  the 
file  chooser  window  that  pops  up,  browse  to  the  pathway  you  want  and  type  in  a 
filename. 

Example:  C:\Documents  and  Settings\carley\Desktop\Lesson 
2\SampleDeleteList 

Click  the  Save  Settings  button  at  the  bottom  right  comer  of  the  Output  Storage  Manager. 
Close  the  Output  Storage  Manager  window  by  clicking  on  the  red  X  at  the  top  right 
corner. 

In  the  Automap  Menu  Bar,  choose  File  >  Save  Delete  List.  This  saves  the  Delete  List  in 
the  location  you  specified  in  the  Output  Storage  Manager. 

XI.  Save  text(s)  after  application  of  Delete  List 

To  determine  where  your  saved  texts  will  end  up,  choose  File  >  Output  Storage  Manager 
(as  you  did  in  the  previous  step). 

A  window  will  pop  up  listing  all  pre-processing  output  storage.  This  is  the  Output 
Storage  Manager. 

Find  the  Texts  After  Deletion  item  in  the  list  on  the  left-hand  side  of  the  Output  Storage 
Manager.  Click  the  Save  file  as. . .  button  to  the  right  of  the  default  file  pathway.  In  the 
file  chooser  window  that  pops  up,  browse  to  the  pathway  you  want. 
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Example:  C:\Documents  and  Settings\carley\Desktop\Lesson 
2\TextsAfterDeletion 

Click  the  Save  Settings  button  at  the  bottom  right  comer  of  the  Output  Storage  Manager. 
Close  the  Output  Storage  Manager  window  by  clicking  on  the  red  X  at  the  top  right 
corner. 

In  the  Automap  Menu  Bar,  choose  File  >  Save  Text(s)  after  Delete  List  Applied.  This 
saves  your  modified  texts  in  the  location  you  chose  in  the  Output  Storage  Manager. 

XII.  Modify  a  Delete  List 

By  clicking  inside  the  text  field  in  the  3. Delete  List  index  card,  you  can: 

•  Add  concepts:  Press  the  <Enter>  key  after  typing  in  a  concept  so  that  there 
is  only  one  concept  per  line 

•  Modify  concepts:  Go  to  the  desired  line  and  retype  the  concept 

•  Drop  concepts:  Highlight  the  concept  and  press  the  <Delete>  key 

XIII.  Create  a  delete  list  from  scratch  in  AutoMap 

There  are  two  ways  to  create  a  new  delete  list: 

Within  AutoMap 

•  Go  to  the  3. Delete  List  index  card  and  click  inside  the  text  field. 

•  Concepts  must  be  arranged  with  one  per  line.  To  do  this,  simply  press 
<Enter>  after  entering  each  concept. 

•  Avoid  empty  lines. 

•  When  you  are  finished  adding  concepts,  click  the  Apply  Delete  List  button. 
Outside  of  AutoMap 

•  Use  any  text  editor  to  create  a  Delete  List.  Keep  the  following  in  mind: 

•  The  general  structure  of  a  Delete  List  requires  one  concept  per  line. 

•  Avoid  empty  lines. 

•  The  Delete  List  is  not  case-sensitive. 

•  Save  the  Delete  List  by  going  to  File  >  Save  As,  and  typing  in  a  filename. 

Example:  C:\Documents  and  Settings\carley\Desktop\Lesson  2\ 
SampleDeleteList.txt 

•  Open  your  new  Delete  List  in  AutoMap:  Go  to  the  File  >  Open  Form  File, 
browse  to  your  file,  and  click  Open. 
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To  delete  concepts  using  this  new  delete  list  and  examine  the  effects  on 
the  text  and  concept  list,  follow  Step  VIII. 

To  save  a  delete  list  you  have  created  from  scratch,  follow  Step  X. 

Example:  C:\Documents  and  Settings\carley\Desktop\Lesson 
2\AppliedDeleteList 

XIV.  Open  a  chosen  delete  list  with  Microsoft  Excel 

To  open  your  delete  list  outside  of  AutoMap,  navigate  to  the  location  in  which  the  delete 
list  was  saved,  and  double-click  on  it.  A  window  reading  <Open  With>  pops  up.  Select 
Microsoft  Excel  and  click  OK.  Provided  Microsoft  Excel  is  installed  on  your  computer, 
the  file  will  open. 

Alternatively,  you  can  copy  and  paste  your  delete  list  from  the  AutoMap  GUI  directly 
into  Microsoft  Excel.  Press  <Control>  and  <A>  simultaneously  while  in  the  3. Delete  Eist 
panel  (this  selects  all  concepts  in  your  delete  list),  then  press  <Control>  and  <C> 
simultaneously  (this  copies  the  delete  list).  Go  to  Microsoft  Excel  and  press  <Control> 
and  <V>  simultaneously  (this  pastes  the  delete  list  into  Excel). 

XV.  Modify  a  delete  list  in  Microsoft  Excel 

The  same  rules  that  applied  for  delete  lists  in  AutoMap  also  apply  for  lists  in  Excel: 

•  Use  any  text  editor  to  create  a  Delete  List.  Keep  the  following  in  mind: 

•  The  general  structure  of  a  Delete  List  requires  one  concept  per  line. 

•  Avoid  empty  lines. 

•  The  Delete  List  is  NOT  case-sensitive. 

•  Save  the  Delete  List  by  going  to  File  >  Save  As,  and  typing  in  a  filename. 

•  Open  your  new  Delete  List  in  AutoMap:  Go  to  the  File  >  Open  Form  File, 
browse  to  your  file,  and  click  Open. 

To  un-apply  a  Delete  Eist  you  have  created  from  scratch  and/or  modified,  follow  Step  IX. 
To  load  in  a  modified  delete  list,  follow  Step  VII. 

To  delete  concepts  using  a  modified  delete  list  and  examine  the  effects  on  the  text  and 
concept  list,  follow  Step  VIII. 
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Lesson  3 

Processing  data:  using  a  Generaiization  Thesaurus 
i.  Open  AutoMap 
On  Empire: 

•  Navigate  to  C:\Documents  and  Settings\Carley\Desktop\Tools. 

•  Double-click  on  the  Run  Automap  icon.  The  Automap  Graphical  User  Interface 
(GUI)  pops  up. 

From  anywhere  else:  (requires  a  web  connection) 
http://www.casos.cs.cmu.edu/projects/automap/software.html 

//.  Overview  of  the  AutoMap  Graphicai  User  interface  (GUi) 

Below  is  a  screen  capture  of  the  AutoMap  GUI: 


The  AutoMap  GUI  is  divided  into  four  primary  quadrants,  or  panels:  PI  (top  left),  P2 
(bottom  left),  P3  (top  right),  and  P4  (bottom  right)  as  seen  in  the  screen  shot  above. 
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A  drop-down  menu  bar  at  the  top  of  the  window  provides  access  to  analysis  tools  and 
utilities.  The  Browse  Menu  allows  you  to  quickly  navigate  between  documents  you  have 
loaded  into  Automap. 

The  P2  panel  can  be  edited,  but  the  other  panels  (PI,  P3,  P4)  cannot.  Information 
displayed  on  each  panel  is  always  related  to  the  information  displayed  in  the  other  three 
panels.  The  Browse  Menu  relates  to  all  four  panels  at  the  same  time. 

III.  Load  multiple  text  documents  into  AutoMap 

In  order  to  analyze  multiple  documents  simultaneously,  you  will  need  to  store  the 
documents  together  in  one  folder. 

Example:  C:\Documents  and  Settings\carley\Desktop\TextInputFiles02-01-07. 

•  In  the  AutoMap  menu  bar,  select  File,  then  Open  Multiple  Files.  A  folder 
chooser  window  will  pop  up. 

•  Navigate  to  the  folder  containing  the  documents  you  wish  to  analyze,  and 
single-click  on  it. 

Example:  C:\Documents  and  Settings\carley\Desktop\TextInputEiles02-01-07 

•  Click  the  Open  button  in  the  bottom  right  corner. 

The  first  text  will  be  displayed  in  panel  PI  under  the  tab  titled  1. Original  Texts.  You  can 
use  the  Browse  Menu  to  browse  through  the  texts. 

Below  is  a  screen  capture  of  the  first  text  display: 

IV.  Carry  out  a  simple  content  analysis 

A  simple  content  analysis  determines  the  frequencies  of  all  words  in  a  text  document. 

Examine  the  Concept  List: 

The  Concept  Eist  considers  concepts  in  each  text  file  individually.  It  is  created 
automatically  when  a  document  or  set  of  documents  is  loaded  into  Automap,  and  is 
displayed  in  panel  P2  under  the  tab  titled  1. Concept  Eist. 

The  Concept  Eist  tells  you  several  important  facts  about  each  text  in  your  set: 

•  The  number  of  concepts  found  in  the  text  displayed  in  panel  PI  under  the  tab 
titled  1. Original  Text 

•  Related  frequencies  of  those  concepts 

•  Cumulated  unique  concepts  and  total  concepts  contained  in  the  data  set 
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Note:  The  number  of  unique  concepts  considers  each  concept  oniy 
once.  The  number  of  totai  concepts  considers  repetitions  of 
concepts. 

By  default,  the  Concept  List  is  sorted  by  decreasing  frequency  of  concepts.  To  sort  the 
list  alphabetically,  click  on  the  first-column  header  Concept.  To  return  the  list  to 
frequency,  click  on  the  second-column  header  Frequency. 

Below  is  a  screen  capture  showing  part  of  the  Concept  List,  sorted  by  frequency,  for  the 
first  document  in  our  loaded  folder  of  texts  (Textl-in-user's  guide.txt): 

Create  and  refresh  the  Union  Concept  List: 

The  Union  Concept  List  considers  concepts  across  all  texts  in  a  set  (unlike  the  Concept 
List,  which  considers  each  text  in  the  set  individually).  It  is  located  in  panel  P2  under  the 
tab  titled  2. Union  Concept  List. 

The  Union  Concept  List  tells  you  several  important  facts  about  your  text  set: 

•  Concepts  contained  in  all  loaded  text  sets 

•  Related,  cumulative  frequencies  of  concepts  in  all  text  sets 

•  Cumulative  unique  and  total  concepts. 

Before  viewing  your  Union  Concept  List,  you  must  refresh  it.  To  refresh,  go  in  the  Menu 
Bar,  choose  File,  then  Refresh  Union  Concept  List. 

The  Union  Concept  List  can  be  refreshed  after  each  step  of  pre-processing  to  visualize 
the  impact  of  pre-processing  operations  on  the  union  of  concepts. 

Below  is  a  screen  capture  showing  where  to  find  the  Refresh  Union  Concept  List 
command: 

After  refreshing  your  Union  Concept  List,  you  can  view  it  under  the  2.Union  Concept 
List  tab  in  panel  P2.  Below  is  a  screen  capture  showing  where  this  tab  is  located  in  the 
GUI: 

By  default,  the  list  is  sorted  by  decreasing  frequency  of  concepts.  To  sort  the  list 
alphabetically,  click  on  the  first-column  header  Concept.  To  return  the  list  by  Frequency, 
click  on  the  second-header  column  Frequency. 
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Note:  The  number  of  unique  concepts  considers  each  concept  oniy 
once.  The  number  of  totai  concepts  aiso  considers  repetitions  of 
concepts. 

y.  The  basics  of  pre-processing 

AutoMap  follows  a  hierarchy  of  pre-processing  techniques.  If  you  apply  a  pre-processing 
technique  of  a  lower  order  prior  to  a  technique  of  higher  order,  the  pre-processing  will  be 
maintained  through  all  following  procedures  of  higher  order.  If  needed,  you  can  un-apply 
each  technique  after  applying  it. 

If  you  wish  to  apply  multiple  pre-processing  techniques,  carry  out  the  process  in  the 
following  order: 

1.  Named-Entity  Recognition 

This  utility  does  not  impact  the  data.  It  can  be  used  before  any  type  of 
analysis  is  run.  It  can  be  used  before  or  after  Stemming. 

2.  Collocation/Bigram  Identification 

This  utility  does  not  impact  the  data.  It  can  be  used  before  any  type  of 
analysis  is  run. 

3.  Stemming 

This  can  be  used  before  any  type  of  analysis  is  run.  It  can  be  used  before  or 
after  Named-Entity  Recognition. 

4.  Deletion 

This  can  be  used  before  any  type  of  analysis  is  run. 

5.  Thesauri 

o  Generalization  Thesaurus:  This  can  be  applied  before  Semantic 
Network  Analysis  is  run.  It  can  be  applied  before  Meta-Matrix 
Thesaurus  is  applied. 

o  Meta-Matrix  Thesaurus:  This  has  to  be  applied  if  a  Meta-Matrix 
Analysis  will  be  run. 

o  Sub-Matrix  Selection:  This  can  only  be  performed  if  the  Meta-Matrix 
Thesaurus  was  applied.  It  must  be  applied  if  a  Sub-Matrix  Analysis  will 
be  run. 

This  hierarchy  is  reflected  in  the  numbering  of  the  index  card  tabs  in  panels  PI  and  P2,  to 
make  the  sequence  of  the  pre-processing  steps  more  intuitive. 

Below  is  a  screen  capture  showing  the  hierarchy  of  pre-processing  techniques  in  panels 
Pi  (top)  and  P2  (bottom): 

Vi.  Open  a  Generaiization  Thesaurus 

From  the  Menu  bar  select  File  >  Open  Generalization  Thesaurus. 

The  thesaurus  will  be  displayed  in  panel  P2  under  the  3. Preprocessing  Settings  tab,  as  a 
sub-tab  titled  4. Generalization  Thesaurus. 
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VII.  Apply  a  thesaurus  and  examine  its  effect  on  the  texts  and 
concept  list 

Note:  If  you  wish  to  apply  both  a  Delete  List  and  a  Generalization 
Thesaurus,  be  sure  to  use  first  the  Delete  List  and  then  the 
Thesaurus. 

When  applying  a  thesaurus,  you  have  the  following  options: 

Thesaurus  content  only 

If  this  option  is  chosen,  AutoMap: 

1.  Searches  the  texts  for  concepts  specified  in  the  thesaurus. 

2.  Translates  matches  into  key  concepts. 

3.  Maintains  only  key  concepts  in  the  pre-processed  texts.  The  remaining  input 
text  is  dropped  and  is  not  considered  in  further  pre-processing  or  analysis. 
The  original  distances  of  the  key  concepts  are  not  maintained,  but 
punctuation  marks  (such  as  at  the  ends  of  sentences  and  paragraphs)  are 
maintained  and  considered  in  analysis. 

4.  Results  in  all  key  concepts  in  the  text  appearing  directly  adjacent  to  each 
other. 

If  this  option  is  not  chosen.  Automap: 

1.  Searches  the  texts  for  concepts  specified  in  the  thesaurus. 

2.  Translates  matches  into  key  concepts. 

3.  Keeps  the  rest  of  the  text  as  is.  This  means  that  all  other  concepts  in  the  text 
that  did  not  match  concepts  specified  in  the  thesaurus  will  not  be  affected  in 
any  way.  Original  distances  of  both  unaffected  concepts  and  key  concepts  are 
maintained  (unless  a  concept  consisting  of  more  than  one  word  was 
translated  into  a  key  concept). 

If  you  do  not  choose  “thesaurus  content  only”,  the  setting  will  not  be  applied.  If  you  do 
choose  “thesaurus  content  only”,  you  then  have  another  option: 

Direct  or  rhetorical  adjacency 

Direct  adjacency  means  that  original  distances  between  concepts  that  represent  the  key 
concepts  are  neither  visualized  nor  considered  for  analysis. 

Rhetorical  adjacency  means  that  the  original  distances  between  key  concepts  are  retained 
and  incorporated  into  later  analyses.  The  original  distances  are  visually  symbolized  by 
placeholders  (&ldquot;xxx“). 
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To  choose  the  direct  adjacency  option,  click  the  Direct  button  in  the  Adjacency  field  in 
the  4. Generalization  Thesaurus  index  card  under  3. Pre-processing  tab  in  panel  P2.  To 
choose  the  rhetorical  adjacency  option,  click  the  Rhetorical  button  in  the  same  field. 

If  you  do  not  change  the  adjacency  option,  AutoMap  uses  direct  adjacency  as  the  default. 
Also,  direct  adjacency  is  automatically  applied  if  the  “thesaurus  content  only”  option  is 
not  checked. 

To  select  the  thesaurus  content  only  option,  check  the  box  for  Thesaurus  content  only  the 
4. Generalization  Thesaurus  index  card  in  panel  P2.  You  can  then  choose  either  direct  or 
rhetorical  adjacency  by  clicking  in  the  circles  with  those  labels. 

To  apply  your  chosen  settings,  click  the  Apply  button  on  the  4. Generalization  Thesaurus 
index  card  in  panel  P2. 

To  switch  back  to  not  using  the  “thesaurus  content  only”  option,  un-check  the  Thesaurus 
content  only  box  in  the  4. Generalization  Thesaurus  index  card  in  panel  P2,  then  apply  the 
Generalization  Thesaurus  again. 

VIII.  Un-apply  a  Generalization  Thesaurus 

To  un-apply  a  Generalization  Thesaurus  that  was  applied  to  the  data,  go  to  the 

4.  Generalization  Thesaurus  index  card  and  click  the  Un-Apply  button.  This  clears  the 

5.  Texts  After  Generalization  tab  in  panel  PI. 

IX.  Modify  the  generalization  thesaurus  list 

You  can  modify  your  generalization  thesaurus  by  clicking  inside  the  text  field  on  the 
4. Generalization  Thesaurus  index  card  in  panel  P2.  Keep  in  mind  the  following  points: 

1.  Every  line  contains  Concept  /  Key  Concept  (in  other  words,  Old  Word  /  New 
Word.) 

2.  A  Concept  can  be  one  or  more  words. 

3.  A  Key  Concept  is  one  word. 

4.  Be  sure  to  separate  the  words  by  columns. 

5.  The  Thesaurus  is  not  case  sensitive. 

To  apply  a  modified  thesaurus  and  examine  its  effects  on  the  texts  and  concept  list, 
follow  Step  VII. 

To  un-apply  a  modified  thesaurus,  follow  Step  VIII. 

X.  Create  a  generalization  thesaurus  list  from  scratch 

You  can  create  a  Generalization  Thesaurus  from  scratch  in  AutoMap. 
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Find  the  4. Generalization  Thesaurus  index  card  under  the  3. Preprocessing  Settings  tab  in 
panel  P2.  You  will  see  a  text  field,  composed  of  two  columns  titled  “concept”  and 
“generalization”,  that  resembles  a  Microsoft  Excel  spreadsheet.  To  edit  items  inside  this 
text  field,  simply  double-click  on  them. 

Automap  can  help  with  the  building  of  a  generalization  thesaurus  by  loading  the  Union 
Concept  List  from  the  highest  level  of  pre-processing  applied  into  the  Generalization 
Thesaurus  field. 

To  load  in  the  highest-level  union  concept  list: 

1.  Follow  Step  4  to  create  and  refresh  the  Union  Concept  List 

2.  Click  the  Load  Union  Concept  List  button  on  the  4. Generalization  Thesaurus 
index  card. 

Once  loaded  into  AutoMap,  the  Concept  List  can  be  refined  by  applying  Named-Entity 
Recognition  and  Deletion  prior  to  application  of  the  Generalization  Thesaurus.  (See 
Lessons  1  and  2). 

To  apply  the  thesaurus  you  have  made  from  scratch  and  examine  its  effect  on  the  texts 
and  concept  list,  follow  Step  VII. 

XI.  Save  the  Generalization  Thesaurus 

To  specify  a  location  for  your  Generalization  Thesaurus  file,  select  Lile  in  the  Menu  Bar, 
then  Output  Storage  Manager. 

Below  is  a  screen  capture  showing  where  to  find  this  in  the  GUI: 

A  window  will  pop  up  listing  all  pre-processing  output  storage.  This  is  the  Output 
Storage  Manager. 

Lind  the  Applied  Generalization  Thesaurus  item  in  the  list  on  the  left-hand  side  of  the 
Output  Storage  Manager.  Click  the  Save  file  as. . .  button  to  the  right  of  the  default  file 
pathway.  In  the  file  chooser  window  that  pops  up,  browse  to  the  pathway  you  want  and 
type  in  a  filename. 

Example:  C:\Documents  and  Settings\carley\Desktop\Lesson 
3\AppliedGeneralizationThesaurus 

Click  the  Save  Settings  button  at  the  bottom  right  comer  of  the  Output  Storage  Manager. 
Close  the  Output  Storage  Manager  window  by  clicking  on  the  red  X  at  the  top  right 
corner. 

In  the  Automap  Menu  Bar,  choose  Lile,  then  Save  applied  Generalization  Thesaums. 
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XII.  Open  the  Generalization  Thesaurus  in  Microsoft  Excel 

Outside  of  Automap,  navigate  to  the  Generalization  Thesaurus  you  have  saved  and 
double-click  on  it.  A  window  titled  “Open  With”  will  pop  up.  Double-click  on  Microsoft 
Excel.  The  file  will  open  in  Excel  (provided  you  have  Excel  installed  on  your  computer). 

Alternatively,  you  can  copy  and  paste  your  Generalization  Thesaurus  into  Excel  directly 
from  Automap.  Single-click  inside  the  text  field  on  the  4. Generalization  Thesaurus  index 
card  in  panel  P2.  Press  <Control>  and  <A&t;  simultaneously  to  select  all  the  text, 
followed  by  and  simultaneously  to  copy  the  selected  text.  Then  go  into  a  blank  Excel 
spreadsheet  and  press  and  simultaneously  to  paste  the  generalization  thesaurus  into  the 
spreadsheet. 

XII.  Modify  the  Generalization  Thesaurus  in  Microsoft  Excel 

You  can  modify  your  generalization  thesaurus  in  Microsoft  Excel  just  as  for  a  standard 
spreadsheet.  Keep  in  mind  the  following  points: 

1.  Every  line  contains  Concept  /  Key  Concept  (in  other  words,  Old  Word  /  New 
Word.) 

2.  A  Concept  can  be  one  or  more  words. 

3.  A  Key  Concept  is  one  word. 

4.  Be  sure  to  separate  the  words  by  columns. 

5.  The  Thesaurus  is  not  case  sensitive. 

To  load  in  a  thesaurus  you  have  modified  in  Microsoft  Excel,  follow  Step  VI. 

To  apply  this  modified  thesaurus  and  examine  its  effects  on  the  texts  and  concept  list, 
follow  Step  VII. 

To  un-apply  this  modified  thesaurus,  follow  Step  VIII. 

Lesson  4 

AutoMap  Usage  "Snapshots” 

1.  Snapshot:  Split  input  Text  Files 

Purpose 

Split  large  text  files  into  smaller  ones  of  minimum  equal  size. 

When  to  apply  it:  In  order  to  speed  up  AutoMap  coding. 

Input  from  user:  Number  of  words  (NW)  that  each  text  file  should  contain  after  splitting. 
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How  it  works 


Each  text  will  be  split  at  the  next  sentence  mark  after  the  number  of  words  that  the  user 
had  specified.  Thus,  each  resulting  split  text  will  contain  at  least  NW  words. 

Output 

N  texts  that  contain  at  least  NW  in  directory  specified  by  the  user.  The  resulting  texts 
maintain  the  original  filename  plus  a  counter,  starting  from  0  and  going  up  to  N,  where  N 
indicates  the  largest  number  of  texts  that  an  original  text  had  been  split  up  into. 

How  To 

Click  the  Tools  menu  and  select  Open  Text  File  Splitter.  Follow  the  directions  specified 
in  the  user  interface. 

2.  Snapshot:  Using  Compare  Maps 

Purpose 

•  Compare  individual  mental  maps  and  structures  of  social  and  organizational 
systems  extracted  from  texts  with  AutoMap  by  using  set  theory. 

•  Consolidate  individual  mental  maps  and  structures  of  social  and  organizational 
systems  extracted  from  texts  with  AutoMap  to  a  degree  specified  by  the  user. 

How  To 

Click  the  Tools  menu  and  select  Open  CompareMap.  For  further  instructions  consult  the 
CompareMap  User's  Guide. 

3.  Snapshot:  Merge  DyNetML  Files 

Purpose 

Merge  multiple  DyNetMF  files  into  1  DyNetMF  file. 

Example 

This  might  be  needed  for  example  when  DyNetMF  files  that  was  generated  per  text 
during  Sub-Matrix  Text  Analysis  need  to  be  consolidated  into  1  DyNetMF  file  that 
represent  the  entire  text  set. 

Output 

1  DyNetMF  file. 
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How  To 


Click  the  Tools  menu  and  select  Open  DyNetML  File  Merger.  Follow  the  directions 
specified  in  the  user  interface. 

4.  Snapshot:  Convert  Network  Data  Formats 

Purpose 

Convert  a  file  in  a  specific  network  data  format  (CVS,  DL,  UCINET,  DyNetML,  VNA) 
into  another  network  data  format. 

How  To 

Click  the  Tools  menu  and  select  Open  Matrix  Editor.  Eollow  the  directions  specified  in 
the  user  interface. 

5.  Snapshot:  Edit  Network  Data 

Purpose 

Edit  relational  data. 

How  To 

Click  the  Tools  menu  and  select  Open  Network  Data  Eormat  Converter.  Eollow  the 
directions  specified  in  the  user  interface. 

6.  Snapshot:  Visualize  Semantic  Networks 

Purpose 

Visualize  mental  models  and  social  structure. 

How  To 

•  Within  AutoMap:  click  the  Tools  menu  and  select  Open  Sociallnsight  Visualizer 
Example:  Load  in  DyNetML  files  created  in  AutoMap. 

•  With  External  Tools:  Convert  DyNetML  or  DL  files  generated  with  AutoMap  in 
formats  required  by  the  external  tool  you  want  to  use: 

Example:  Convert  DL  into  VNA  files  can  be  visualized  in  NetDraw.  In  NetDraw 
open  VNA  file:  Eile  >  Open  >  VNA  text  file  >  complete. 
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